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▲  IhMry  of  Diagnostic  Inforonco: 

II.  Jtoilfing  Ckusallty 

It  vu  late  in  middle  age  that  toilers 'a  character,  tonsieur  Jourdain, 
aade  the  surprialng  discovery  that  he  had  been  speaking  prose  all  his  life. 
Similarly,  people  may  be  equally  surprised  to  learn  that  they  have  been 
engaged  in  diagnostic  inference  all  their  lives.  By  "diagnostic  inference" 
we  wean  the  following:  given  the  occurrence  of  a  set  of  outcomes/ results/ 
symptoms,  people  infer  what  causal  process  could  have  produced  the  observed 
effects.  The  essential  aspects  of  such  Inferences  are  that  they  are  causal 
rather  than  correlational,  backward  rather  than  forward  (one  goes  from  effects 
to  prior  causes),  concerned  with  a  specific  rather  than  the  general  case,  and 
constructive  (one  can  synthesise,  enlarge,  or  otherwise  develop  new  hypo¬ 
theses)  .  The  Importance  of  diagnostic  inference  goes  beyond  its  obvious  role 
in  making  sense  of  experience;  it  is  crucial  for  prediction  as  well  as  for 
defining  what  is  a  "relevant"  variable  (Elnhorn  &  Hogarth,  Note  1).  Further¬ 
more,  since  the  evidence  that  one  has  for  making  diagnoses  is  fallible  and/or 
conflicting ,  the  process  takes  place  under  uncertainty.  Thus,  the  essential 
nature  of  Inference,  "going  beyond  the  information  given"  (Bruner,  19S7),  is 
as  true  for  diagnosis  as  for  prediction.  However,  while  much  attention  in  the 
psychological  literature  has  been  devoted  to  prediction  (e.g.,  Kahneman  & 

Tver sky,  1973),  far  less  has  been  given  to  diagnosis. 

In  our  earlier  work  (Elnhorn  &  Hogarth,  Note  1),  a  model  was  developed  to 
describe  how  people  assess  the  strength  of  conflicting  evidence  in  diagnostic 
situations.  Specifically,  given  that  some  event  occurred  (e.g.,  car  accident, 
bank  robbery,  etc.),  we  considered  the  judged  strength  of  particular  causal 
hypotheses  based  on  varying  amounts  of  evidence  for  each.  These  judgments 
ware  modeled  as  resulting  from  an  anchoring  and  adjustment  process  where  the 


evidence  at  hand  formed  the  anchor  and  adjuataenta  were  made  on  the  baste  of 
imagined  contrast  cases.  In  fact,  the  model  was  interpreted  as  implying  that 
people  combine  "what  is"  with  "what  might  have  been"  in  assessing  evidentiary 
strength.  Thus,  the  ability  to  imagine  alternative  scenarios  and  to  engage  in 
counterfactual  reasoning  was  shown  to  be  a  crucial  component  of  diagnostic 
inference,  even  in  simple  situations.  Moreover,  extensions  of  the  model 
demonstrated  how  the  number  and  similarity  of  alternative  hypotheses  affected 
the  strength  of  evidence.  For  example,  imagine  that  there  were  10  witnesses 
to  a  hit-and-run  automobile  accident;  4  witnesses  reported  that  the  offending 
car  was  green  and  6  reported  blue.  On  the  basis  of  this  evidence,  how  likely 
do  you  think  that  the  offending  car  was  green?  Now  imagine  the  situation 
where  4  reported  green,  2  reported  blue,  2  reported  yellow,  and  2  reported 
red.  Would  this  change  your  assessment  that  the  car  was  green?  For  many 
people,  the  different  atructure  of  the  alternatives  implies  differences  in 
evidentiary  strength.  For  Instance,  the  fact  that  four  different  colors  were 
-sported  could  lead  to  the  diagnosis  that  the  viewing  conditions  were  poor 
(fog,  darkness,  etc.),  thereby  reducing  the  likelihood  of  all  hypotheses. 
Similarly,  when  evidence  is  missing  due  to  non-responses,  it  is  often  seen  as 
diagnostic  of  certain  causal  processes.  For  example,  in  assessing  the 
likelihood  of  success  of  job  candidates  on  the  basis  of  letters  of  reference, 
non-responses  were  deemed  to  be  diagnostic  of  the  writer's  unwillingness  to 
say  unflattering  things  about  the  candidate  and  were  therefore  treated  as 
negative  evidence. 

We  interpreted  our  model  and  results  by  drawing  an  analogy  between  the 
processes  of  diagnosis  and  perception.  In  particular:  (1)  The  strength  of 
evidence  was  seen  to  be  the  net  effect  of  the  evidence  one  has  compared  to  the 
evidence  that  could  have  been.  Similarly,  the  salience  of  objects  in  percep- 
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tion  can  ba  viewed  as  the  net  effect  of  figure  against  ground;  (2)  As  In 
perception,  people  are  constantly  engaged  In  diagnostic  Inference  without 
active  awareness;  (3)  Diagnosis  Is  a  constructive  process  In  that  people  bring 
prior  expectations  to  bear  In  Interpreting  Information  and  In  enlarging  hypo¬ 
theses  to  account  for  complex  outcomes.  In  analogous  fashion,  the  Importance 
of  expectations  and  the  constructive  nature  of  "achieving**  the  object  are  well 
established  In  perception  (cf.  Garner,  1966).  Htreover,  the  introduction  of 
expectations  as  central  to  diagnosis  (and  perception)  highlights  the  role  of 
content  knowledge  in  the  assessment  of  evidence  and  raises  questions  of  how 
such  knowledge  Is  used. 

We  take  the  position  that  the  meaning  of  evidence  Is  a  function,  not  only 
of  the  amount  of  evidence  for  or  against  some  position,  but  of  the  causal 
significance  that  people  give  to  that  evidence.  Thus,  judgments  of  causality 
are  crucial  to  diagnosis  since  the  Identification  of  "relevant"  variables 
rests  heavily  on  what  is  seen  as  causally  relevant ,  and  the  judged  strength  of 
relations  between  variables  is  Influenced  by  the  strength  of  their  perceived 
causal  links. 

General  Approach 

We  conceive  of  diagnostic  Inference  as  consisting  of  two  inter-related 
phases:  (1)  determining  what  are  causally  relevant  variables;  and  (2)  assess¬ 
ing  the  strength  of  causal  links.  Our  focus  will  be  on  the  second  phase 
although  we  briefly  consider  the  first.  In  accord  with  our  basic  premise  that 
the  strength  of  evidence  Involves  a  figure/ground  relation,  we  propose  that 
the  strength  of  the  causal  link  between  some  effect  T  and  a  suspected 
cause  Xj  (J  »  1,  2,  n),  Is  given  by, 
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sJy.x^bJ  -  rt.Or.XjlB)  -J^y.xJb)]  (j*k)  (l) 

where, 

SQ(Y,Xj  J  B)  -  net  strength  of  the  csusal  link  of  Y  with  Xj 
conditional  on  background  B. 

e(Y,Xj | B)  -  gross  strength  of  the  causal  link  of  Y  with  Xj 
conditional  on  background  B. 

s(Y,X|Jb)  -  gross  strength  of  alternative  factor  \ 
conditional  on  background  B. 

F  is  a  no no tonic ally  increasing  function* 

There  are  several  important  aspects  of  equation  (1):  (a)  All  terms  are 
conditioned  on  some  assumed  or  implicit  causal  background.  Thus ,  the  strength 
of  any  factor  as  a  cause  of  Y  depends  on  the  context  being  considered.  For 
example,  few  persons  would  attribute  the  cause  of  a  house  fire  to  the  presence 
of  oxygen;  however,  this  could  be  deemed  causally  relevant  in  a  fire  that  took 
place  in  a  manned  space  vehicle.  Therefore,  changes  in  the  assumed  causal 
background  can  lead  to  different  Judgments  of  a  variable's  causal  strength. 

He  consider  this  further  in  the  next  section;  (b)  The  strength  of  a  causal 
link  is  conceptualized  as  its  net  strength;  l.e.,  SQ(Y,Xj|B)  is  a  monotonic 
function  of  the  difference  between  the  gross  strength  of  Xj  minus  the  sum  of 
the  gross  strengths  of  specific  alternatives.  While  we  examine  the  deter* 
minants  of  gross  strength  in  detail  below,  for  now  it  can  be  viewed  as  ana¬ 
logous  to  the  absolute  strength  of  a  signal  perceived  against  a  noiseless 
background.  Thus,  the  net  strength  of  the  Y  ♦  X ^  causal  link  can  be  seen  as 
resulting  from  two  conflicting  forces;  the  strength  of  the  signal  against  the 
competing  signals  that  comprise  imagined  alternative  explanations.  Equation 


(1)  Is  therefore  a  conflict  aodel  of  evidence  that  Is  conceptually  Identical 
to  our  earlier  model  for  assessing  evidentiary  strength  (Einhorn  &  Hogarth, 
Note  1).  Itoreover,  It  should  be  noted  that  equation  (1)  embodies  the 
** discounting  principle"  in  attribution  theory  (Kelley,  1973);  namely,  "the 
role  of  a  given  cause  in  producing  a  given  effect  Is  discounted  if  other 
plausible  causes  are  also  present."  (p.  113);  (c)  Equation  (1)  can  be  viewed 
as  representing  a  multi-stage  anchoring  and  adjustment  process  whereby  one 
anchors  on  the  gross  strength  of  s(T,Xj|b)  and  sequentially  adjusts  for  the 
strengths  of  Imagined  alternatives.  Indeed,  the  case  for  a  multi-stage 
anchoring  and  adjusting  strategy  in  assessing  net  strength  seems  compelling 
since  this  decomposes  the  task  into  a  series  of  smaller,  more  manageable 
components,  thereby  reducing  demands  on  memory  by  using  a  "running  total"  of 
causal  net  strength;  (d)  Equation  (1)  posits  that  net  strength  follows  a 
difference  rather  than  a  ratio  model .  This  has  imp  rtant  implications  for  the 
case  where  few  or  no  alternatives  are  imagined.  For  example,  a  ratio  model 
(such  as  probability  theory)  would  treat  the  strength  of  evidence  for  a 
hypothesis  as  certain  if  there  were  no  alternatives.  However,  in  equation 
(1),  net  strength  can  be  low  when  there  are  no  alternatives  if  the  gross 
strength  of  Xj  is  itself  low.  Itoreover,  net  strength  can  also  be  low  when 
gross  strength  is  high  if  there  are  many  strong  alternatives.  Indeed,  net 
strength  can  only  be  high  if  gross  strength  is  high  and  the  strength  of 
specific  alternatives  is  low. 

To  illustrate  these  implications,  consider  the  following  thought  experi¬ 
ment.  What  is  the  net  strength  of  any  single  explanation  for  the  following 
questions? 

1.  Why  are  the  outer  rings  of  Saturn  braided? 

2.  Why  was  Ronald  Reagan  elected  President  in  1980? 
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3.  The  face  of  a  watch  was  struck  sharply  with  a  hammer. 

Why  did  the  glass  break  and  shatter? 

For  the  first  question,  it  is  difficult  to  generate  a  single  explanation,  let 
alone  any  alternatives,  and  net  strength  is  low.  For  the  second  question, 

there  are  many  alternatives  for  any  explanation  generated  (e.g.,  the  situation 

of  the  economy;  the  rise  of  the  "moral  majority;"  the  unresolved  hostage 

crisis;  Carter's  ineptitude  in  foreign  policy;  etc.).  Thus,  net  strength  in 

any  particular  explanation  is  also  low.  On  the  other  hand,  for  the  third 
question,  only  one  explanation  seems  plausible;  i.e.,  striking  the  watch  face 
with  a  hammer  caused  the  glass  to  break  and  shatter.  Thus,  net  strength  is 
high  since  the  explanation  is  strong  (for  reasons  to  be  considered  later),  and 
it  is  difficult  to  imagine  specific  alternatives.  In  summary,  we  argue  that 
like  good  patterns,  good  explanations  have  few  alternatives  (Garner,  1970); 
or,  to  be  more  precise,  whereas  good  explanations  imply  few  alternatives,  the 
lack  of  alternatives  does  not  imply  good  explanations. 

Plan  of  the  Paper 

Why  are  some  variables  deemed  to  be  causally  relevant  while  others  are 
Ignored?  It  is  argued  that  causally  relevant  variables  are  perceived  as 
differences  in  a  causal  background  (Hickle,  1974)  and  causal  strength  is  a 
function  of  that  background.  An  experiment  is  then  described  that  illustrates 
the  importance  of  background  changes  on  the  judged  strength  of  hypotheses. 
Thereafter,  the  mechanism  by  which  the  strength  of  causal  links  is  assessed 
within  a  background  is  examined  by  discussing  the  concept  of  "cues  to 
causality."  These  cues  are  fallible  indicators  of  causal  relations  and  are 
hypothesised  to  affect  the  gross  strength  of  explanations,  tbreover,  the  cues 
are  categorised  into  two  classes;  those  that  Involve  data  and  those  that  are 


baaed  on  contone.  The  t rode -off  between  these  categories  in  judging  causal 
strength  is  then  discussed  in  relation  to  the  concept  of  spurious  correlation 
and  the  fact  that  causation  does  not  necessarily  laply  correlation.  Following 
this,  the  manner  in  which  both  types  of  cues  are  combined  in  judgments  of 
causal  strength  is  investigated  in  several  experiments,  as  is  the  role  of 
alternative  explanations. 

The  discussion  of  our  experimental  results  centers  on  the  roles  played  by 
the  three  key  concepts  of  our  theoretical  framework:  (1)  the  causal  back¬ 
ground;  (2)  the  cues  to  causality;  and  (3)  the  importance  of  alternative 
explanations.  Finally,  we  discuss  the  normative  Implications  of  our  work  « 
respect  to  three  trade-offs  implicit  in  causal  reasoning. 


The  Diagnostic  Process 

Determining  Causal  Relevance 

Mich  research  Indicates  that  processes  of  perception  and  judgment  are 
sensitive  to  differences  or  deviations  from  present  states,  adaptation  levels, 
and  reference  points  (e.g.,  Belson,  1964;  Kahneman  &  Tversky,  1979).  Mire- 
over,  we  believe  that  this  sensitivity  also  applies  to  the  types  of  events 
that  attract  attention  and  arouse  diagnostic  curiosity.  Specifically,  we 
expect  that  events  of  diagnostic  Interest  are  those  that  are  unusual, 
abnormal,  or  unlikely.  Thus,  one  rarely  seeks  the  cause  of  why  one  feels 
"average,"  why  traffic  flowed  normally,  or  why  some  accident  is  typical.  To 
be  sure,  diagnostic  curiosity  can  be  aroused  vis-a-vis  normal  events.  How¬ 
ever,  we  believe  this  is  most  likely  to  happen  when  those  events  violate 
expectations  and  are  therefore  seen  as  unusual  after  all.  For  example,  we 
might  want  to  know  why  traffic  flowed  normally  if  major  highway  improvements 
were  just  completed,  or  why  we  feel  "average"  after  bearing  about  a  daath  in 
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the  family.  Therefore,  diagnostic  Inference  is  invoked  to  make  sense  of  devi¬ 
ations  via  causal  explanation.  However,  it  is  Important  to  note  that  the 
meaning  of  a  deviation  is  Itself  crucially  dependent  on  some  assumed 
background  or  field.  Indeed,  even  averages  can  be  made  unusual  with  the 
appropriate  shift  of  background — consider  Oscar  Wilde's  statement  that, 
"moderation  shouldn't  be  taken  to  extremes." 

We  propose  that  in  searching  for  a  cause  of  some  outcome  which  is  a 
deviation  from  the  normal  or  average,  attention  is  directed  toward  prior 
deviations  or  abnormal  events.  Thus,  unusual  effects  are  seen  as  the  result 
of  unusual  causal  circumstances.  In  fact,  one  can  consider  this  belief  a 
special  case  of  the  "representativeness"  heuristic  (Kahneman  &  Tversky,  1972) 
In  that  causes  and  effects  are  similarly  discrepant  from  some  assumed  causal 
background.  However,  the  manner  in  which  the  causal  background  affects  the 
strength  of  causal  links  needs  to  be  considered  in  more  detail.  Specifically, 
we  wish  to  show  that  causal  relevance  is  directly  related  to  the  degree  that  a 
variable  is  a  difference-ln-a-background  (Hackle,  1974).  In  order  to  illus¬ 
trate  this,  reconsider  our  example  of  oxygen  being  causally  irrelevant  in  a 
house  fire  but  relevant  in  a  fire  on  a  spaceship.  Furthermore,  in  accord  with 
equat’  «n  (1),  let  Y  ■  fire,  Xj  -  oxygen,  B  •  causal  background  for  the 
house  fire,  and,  C  m  causal  background  for  space  travel.  In  the  house  fire, 
the  gross  strength  of  oxygen  as  a  causal  agent  is  essentially  zero  since  the 
causal  background  B  already  contains  the  presumption  that  oxygen  was 
present.  Thus,  we  set  s(Y,Xj|b)  -  0  since  Xj  is  part  of  B  and  cannot  be 
a  difference  in  that  background.  tereover,  if  gross  strength  is  zero,  net 
strength  will  also  be  zero  since  there  is  no  need  to  consider  alternative 
explanations  to  a  worthless  hypothesis.  This  follows  from  our  interpretation 
of  equation  (1)  as  embodying  an  anchoring  and  adjustment  strategy;  i.e.,  there 
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1*  no  need  for  adjustment  when  gross  strength  is  low.  This  naans  that 
equation  (1)  only  strictly  applies  when  gross  strength  is  sufficiently  high  so 
that  alternatives  are  considered.  How  consider  the  spaceship  fire;  note 
that  s(Y,Xj|C)  is  not  sero  since  oxygen  is  not  part  of  the  causal  background 
of  space  flights.  Indeed,  leaking  oxygen  would  be  an  important  difference-in- 
the-backg round.  However,  this  is  not  to  say  that  oxygen  would  necessarily  be 
a  strong  causal  candidate  since  its  net  strength  would  depend  on  alternative 
explanations.  On  the  other  hand,  it  would  not  be  immediately  dismissed  as 
Irrelevant,  as  in  the  case  of  a  house  fire. 

Although  the  causal  relevance  of  a  single  explanation  can  be  changed  by 
altering  contexts,  we  now  demonstrate  how  the  relative  likelihoods  of  two 
explanations  can  be  reversed  by  a  shift  in  the  causal  background.  Recall  our 
example  in  which  a  watch  face  is  broken  (Y)  by  the  force  of  a  hammer 
(X^) .  Let  the  assumed  causal  background  in  the  statement  of  the  problem  be 
designated  by  B,  and  let  X2  be  the  alternative  explanation  that  the  glass 
of  the  watch  face  was  defective.  In  background  B  (usual  circumstances) ,  we 
expect  the  net  strength  of  Xj  to  be  judged  greater  than  X2;  thus, 
Sq(Y,Xj|b)  >  Sq(Y,X2 | B) .  This  follows  from  equation  (1)  if  it  is  assumed 
that  s(Y,Xj|b)  >  s(Y,X2|b);  i.e.,  the  gross  strength  of  the  force-of-the- 

hammer  explanation  is  greater  than  that  of  the  defective  glass.  Now  consider 
a  change  of  background  to  a  watch  factory  (C)  where  watch  faces  are  tested 
for  defects  by  striking  them  with  a  hammer.  In  such  a  background,  defects  are 
more  of  a  dlfference-ln-the-background  than  the  force  of  a  hammer,  and  we 
expect  that  Sn(Y,Xj|c)  <  Sn(Y,X2|c).  Thus,  our  prediction  is  that  the 
ordering  of  the  net  causal  strengths  of  the  two  explanations  will  reverse  with 
changes  in  the  background. 


In  order  Co  teat  the  above  prediction,  the  following  experiment  was 
performed.  Subjects  (N  ■  67),  recruited  from  the  University  of  Chicago 
community,  were  asked  to  respond  to  varlot**  experimental  stimuli  as  part  of  a 
study  on  how  judgments  and  choicaa  are  made.  Bach  subject  was  asked  to 
respond  to  two  scenarios,  with  a  gap  of  some  40  minutes  between  them  (during 
which  time  other  experimental  tasks  were  administered).  Subjects  were  randomly 
assigned  to  one  of  two  groups  that  received  the  scenarios  in  different 
orders.  The  two  scenarios  wsre  as  follows: 

(1)  A  watch  la  placed  on  a  table — face  upwards.  A  hammer  Is 
then  brought  down  sharply  on  the  face  of  the  watch.  The 
glass  of  the  watch  face  breaks  and  shatters. 

(2)  In  a  watch  factory,  procedures  exist  for  testing  various 
aspects  of  the  end  product.  One  procedure  Is  the  follow¬ 
ing:  A  watch  is  placed  on  a  table — face  upwards.  A 
hammer  is  then  brought  down  sharply  on  the  face  of  the 
watch.  Imagine  that  on  one  occasion  the  glass  of  the 
watch  face  breaks  and  shatters. 

Both  scenarios  were  followed  by  identical  questions: 

Question:  What  caused  the  glass  to  break? 

a.  The  force  of  the  hammer? 

b.  A  defect  in  the  glass? 

c.  Some  other  explanation  (please  specify). 

Please  circle  the  most  likely  cause. 

Results  of  the  experiment  are  presented  In  Table  1,  and  are  shown  for  the 

Insert  Table  1  about  here 

combined  groups  since  the  order  of  presentation  had  no  significant  effect. 

The  table  shows  both  the  marginal  and  joint  distributions  of  responses  to  the 
different  versions  of  the  scenario.  In  the  first  situation  (background  B),  60 
subjects  (91Z)  judged  the  force  of  the  hammer  as  the  most  likely  cause; 
however,  in  the  factory  setting  (background  C),  the  defect  in  the  glass  is 
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seen  as  Che  aost  likely  causal  agent  (36  aubjecte  or  55%).  Moreover,  of  the 
60  subjects  who  said  the  force  of  the  hammer  was  the  aost  likely  cause  In 
background  B,  32  subjects  reversed  the  order  in  background  C.  thus,  the 
prediction  that  the  relative  strength  of  causal  hypotheses  can  be  changed  by 
varying  the  background  was  substantiated  for  a  majority  of  our  subjects. 

In  addition  to  a  complete  change  of  backgrounds,  equation  (1)  suggests 
that  net  strength  can  also  be  altered  by  narrowing  or  widening  the  saae  back¬ 
ground.  This  occurs  because  alternative  hypotheses  are  either  ruled  out  by 
narrowing  the  context  or  expanded  by  widening  It.  In  terns  of  equation  (1), 

changes  in  the  number  of  Imagined  alternatives  is  represented  by  different 

N 

limits  of  siasatlon  In  the  'i  s(Y,X,  |b)  tern.  Therefore,  the  breadth  of  the 

k-1 

background  can  also  affect  the  causal  strength  of  an  explanation.  Por 
exaaple,  consider  the  following  scenario:  Joe  Is  a  chemical  worker  who 
contracts  lung  cancer  and  sues  the  company  for  causing  his  disease.  His 
lawyer  argues  that  the  cancer  rate  of  workers  in  this  factory  Is  nine  times 
the  national  average  for  workers  In  comparable  industries.  Note  that  the 
background  in  this  argument  Is  Industries  of  a  certain  type  and  the  causal 
argument  rests  on  a  difference  (higher  cancer  rates)  In  this  background. 
However,  note  how  this  argument  would  be  strengthened  If  it  could  be  shown 
that  the  cancer  rate  in  this  factory  was  nine  times  the  rate  of  other  chemical 
factorlea  making  exactly  the  same  product.  The  reason  Is  that  by  narrowing 
the  field  to  chemical  plants  making  the  saae  product,  the  number  of  alter¬ 
native  explanations  Is  reduced  thereby  making  the  difference  in  the  narrowed 
field  more  causally  relevant.  Of  course,  this  does  not  preclude  other  classes 
of  alternative  explanations.  Indeed,  the  defense  lawyer  for  the  chemical 
company  might  try  to  show  that  the  worker  has  smoked  cigarettes  for  years, 
comes  from  a  family  with  a  history  of  respiratory  problems,  and  so  on.  These 
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new  alternatives  clearly  reduce  the  net  strength  of  the  cheaical-factory- 
explanatlon,  in  accord  with  equation  (1). 

The  Idea  that  "relevance"  can  be  affected  by  narrowing  or  widening  the 
context  in  which  Information  la  given  has  also  been  discussed  by  Bar— Hillel 
(1980).  She  considers  the  research  showing  that  people  ignore  base  rates 
(prior  probabilities)  in  making  probabilistic  judgments  when  individuating 
information  is  also  present  (see  e.g.t  Kahn amen  &  Tversky,  1973).  For 
example,  imagine  a  hit-and-run  accident  involving  a  cab  and  a  witness  who  says 
the  cab  was  green.  The  witness  is  tested  and  found  to  be  80Z  correct  in 
identifying  cabs  by  their  color.1  Moreover,  85  Z  of  the  cabs  in  the  city  are 
blue  while  15Z  are  green.  When  subjects  are  asked  how  likely  it  is  that  the 
offending  cab  was  green,  the  median  and  modal  response  is  .80,  whereas  the 
Bayesian  answer  is  .41.  However,  Bar-Hillel  states  that  base  rates  will  be 
used  if  they  are  seen  as  "relevant"  to  the  question  asked,  tbreover,  she 
posits  that  two  factors  will  induce  relevance — greater  specificity  and  causal 
lnterpretablllty  of  the  base  rate.  For  example,  people  will  use  the  base  rate 
in  the  cab  problem  if  it  la  stated  that  85Z  of  the  cab  accidents  in  the  city 
involve  blue  cabs  and  15Z  involve  green  (Tversky  &  Kahneman,  1980);  or,  the 
base  rate  is  made  more  specific,  as  in  stating  that,  "in  the  neighborhood  in 
which  the  accident  occurred,  which  is  nearer  to  the  Green  Ckb  Cbmpany  head¬ 
quarters  than  to  the  Blue  Gab  Company,  80Z  of  all  taxis  are  Green,  and  20Z  are 
Blue"  (Bar-Hillel,  1980,  p.  226).  From  our  perspective,  Bar-Hlllel's 
interpretation  of  relevance  is  similar  to  our  concept  of  net  strength. 

Indeed,  both  are  increased  by  a  causal  interpretation  of  evidence  and  a 
narrowed  background  (specificity)  which  reduces  alternatives. 


Components  of  Grosa  Strength:  Cues  to  Causalit 


The  factors  that  comprise  the  gross  strength  of  a  causal  hypothesis  are 
now  considered  in  detail.  Specifically,  we  hypothesize  that  gross  strength 
(conditional  on  an  assumed  background),  is  a  function  of  various  "cues  to 
causality.'*  The  term  "cues'*  has  a  specific  meaning  that  corresponds  with  its 
use  in  Brunswlk's  psychology  (1952;  also  see  Hammond,  1955;  Campbell,  1966). 
Thus:  (1)  The  relation  between  each  cue  and  causality  is  probabilistic.  That 
is,  each  cue  is  only  a  fallible  sign  of  a  causal  relation;  (2)  People  learn  to 
uae  multiple  cues  in  making  inferences  in  order  to  mitigate  against  the  poten¬ 
tial  errors  arising  from  the  use  of  single  cues;  (3)  The  use  of  multiple  cues 
is  facilitated  by  the  intercorrelation  (redundancy)  between  cues  in  the  envir¬ 
onment.  This  both  reduces  the  negative  effects  of  omitting  cues,  and  aids  in 
directing  attention  to  the  presence  of  others;  (4)  Although  multiple  cues 
reduce  uncertainty  in  inference,  they  do  not  entirely  eliminate  it.  Msreover, 
we  categorize  the  cues  into  two  classes;  those  based  on  data  and  those  derived 
from  content  (cf.  Nisbett  &  Boss,  1980,  pp.  97-101).  In  order  to  illustrate 
this  distinction,  imagine  a  dichotomized  causal  candidate  Xj,  and  dichot¬ 
omized  effect  T.  Furthermore,  consider  the  four  possible  outcomes  that  can 
result  (XjDT,  x^flY,  X^Dy).  By  date-based  cues  we  mean  the  actual  or 

perceived  frequencies  of  cases  in  each  of  the  four  cells,  as  well  as  summary 
measures  of  cell  combinations  (such  as  statistical  correlation).  Thus,  data- 
based  cues  reflect  our  dependence  on  general  statistical  relations  in  the 
environment.  On  the  other  hand,  content-based  cues  refer  to  such  factors  as 
temporal  order  of  X  and  T,  contiguity  in  time  and  space,  and  the  similarity 
of  cause  and  effect.  These  factors  depend  on  one's  knowledge  of  a  specific 
X,  T  pair  and  are  not  statistical  in  character.  While  we  discuss  both  data 
and  content-based  cues  below,  we  wish  to  stress  that  judgments  of  gross 


strength  are  a  function  of  both  types  of  cues.  Indeed,  we  will  endeavor  to 
show  that  there  are  Important  trade-offs  between  then. 

Data-based  cues.  The  degree  to  which  one  variable  is  believed  to  be 
predictive  of  another,  denoted  as  perceived  validity,  is  an  Important  cue  to 
causality.  Mote  that  this  cue  nay  be  similar  or  different  fron  the  statis¬ 
tical  concept  of  correlation.  Indeed,  even  in  statistics,  predictive  accuracy 
is  not  a  unitary  concept  that  can  be  defined  without  consideration  of  some 
loss  function  (e.g.,  mean  squared  error  Is  more  appropriate  than  correlation 
when  predicting  exact  values  of  Y).  From  our  perspective,  when  variables  are 
discrete,  perceived  validity  is  a  function  of  4  cues — the  amount  of  constant 
conjunction,  (XflY);  negative  constant  conjunction,  (XflY) ;  and  two  types  of 
dlsconflrmlng  data,  (XflY)  and  (XflY) .  The  manner  In  which  people  combine 
these  cues  in  judging  covariation  has  been  the  subject  of  much  research  (see 
Crocker,  1981  for  a  review).  While  the  results  of  individual  studies  vary.  It 
can  be  concluded  that  constant  conjunction  is  the  major  factor  influencing 
covariation  judgments  while  negative  constant  conjunction  Is  least  Important. 
The  role  of  dlsconflrmlng  data  Is  less  clear;  some  have  argued  that  people 
Ignore  such  information  while  others  have  shown  that  negative  evidence 
significantly  reduces  perceived  covariation.  Some  recent  studies  speak  to 
this  Issue. 

In  a  meta-analysls  of  several  covariation  studies,  Llpe  (Note  2)  showed 
that  all  components  of  perceived  validity,  except  (XflY),  received  significant 
weights  when  subjects'  judgments  were  regressed  on  all  four  data  cues. 
M»reover,  docker  (1982)  has  found  that  the  form  of  the  question  subjects  are 
asked  affects  the  use  of  dlsconflrmlng  evidence.  Of  particular  relevance  to 
this  Issue  is  a  study  that  addressed  how  judgments  of  causal  strength  are 
based  on  the  four  data  cues  discussed  above  (Schustack  &  Sternberg,  1981). 
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Specifically,  subjects'  causal  judgments  were  regressed  on  all  four  types  of 
data  and  results  showed  significant  positive  coefficients  fu”  both  types  of 
constant  conjunction,  and  significant  negative  coefficients  for  disconflralng 
data.  Since  people  can  and  do  use  disconflralng  data  in  Judging  causal 
strength,  by  what  mechanism  does  this  occur?  In  accord  with  our  emphasis  on 
the  importance  of  alternatives,  we  believe  that  disconflrmlng  data  alert 
people  to  the  existence  of  alternative  explanations  without  necessarily 
specifying  what  they  are.  Indeed,  one  source  of  uncertainty  In  assessing 
causal  relations  may  involve  a  general  awareness  of  one's  lack  of  knowledge  of 
specific  causal  factors.  Thus,  our  position  is  that  "knowing  you  don't  know" 
reduces  the  gross  strength  of  a  relation  independently  of  knowledge  of 
specific  alternatives. 

When  variables  are  continuous,  cues  to  perceived  validity  Include  such 
factors  as  the  rate  at  which  X  and  Y  covary,  the  direction  of  the 
relation,  the  absolute  levels  of  the  variables,  and  amount  of  noise  (l.e., 
random  error).  For  example,  when  subjects  are  required  to  learn  linear 
relations  In  single-cue  probability  tasks,  their  performance  varies  as  a 
function  of  the  magnitude  and  sign  of  the  slope,  the  level  of  the  Intercept, 
and  the  amount  of  error  In  the  function  (Brehmer,  1980).  Note,  however,  that 
subjects  in  these  tasks  are  typically  not  required  to  Judge  the  strength  of 
relations;  rather,  they  have  to  learn  the  function  by  making  predictions  and 
observing  outcome  feedback.  Nevertheless,  we  view  the  above  factors  as  cues 
to  relational  strength  in  the  continuous  case  which  play  an  analogous  role  to 
the  four  cues  In  the  dichotomous  case.  In  particular,  the  level  of  random 
error  In  the  former  approximates  the  amount  of  disconflralng  data  In  the 
latter.  In  fact,  our  assertion  that  disconflralng  data  Increase  general 
awareness  of  alternative  hypotheses  Is  supported  by  Brehmer 's  (1980)  results 
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that  subjects  search  for  alternative  hypotheses  when  their  predictions  are 
discrepant  froa  outcoae  feedback  (see  also,  Gettys  &  Fisher,  1979). 

He  also  postulate  that  people  are  sensitive  to  the  extent  to  which 
statistical  ralations  vary  as  a  function  of  other  variables  and  call  this  cue, 
the  robustness  of  a  relation  (see  Toda,  1977).  For  example,  imagine  that 
there  is  a  positive  correlation  between  smoking  and  lung  cancer.  Now  consider 
that  the  correlation  is  computed  separately  for  men  and  women  with  the 
following  result:  The  correlation  is  positive  for  men  but  zero  for  women. 

Note  that  by  dividing  the  original  sample  into  sub-groups,  one  now  considers 
several  relations  and  asks  whether  the  correlation  is  the  same  in  all 
groups.  If  it  is  not,  the  relation  is  not  robust  and  the  causal  relevance  of 
smoking  for  lung  cancer  is  decreased.  On  the  other  hand,  if  a  statistical 
relation  is  robust,  it  points  more  strongly  to  a  causal  relation. 

The  cues  discussed  above  can  be  conceptualized  as  forming  part  of  a  hier¬ 
archical  structure.  At  the  most  general  level,  gross  strength  is  a  function 
of  data  and  content;  l.e.. 


afT.XjlBj-  f (Data,  Content)  (2) 

At  a  second  level,  the  data  component  of  gross  strength  is  a  function  of 
perceived  validity,  (?  [r^]  J »  and  robustness;  thus 

Data  -  g.(T[r  ],  robustness)  (3) 

x  xy 

At  a  more  specific  level,  perceived  validity  is  itself  dependent  on  various 


cues.  That  is 
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1(Xf|Y),  (XflY),  (XDY),  (IflY);  In  Che  dichotomous  case. 

(4) 

rate,  direction,  level,  noise;  In  Che  continuous  case. 

Therefore,  In  asking  causal  judgments,  specific  level  cues  are  combined  Into 
higher  level  units  which  are  themselves  cues  to  the  data  component  of  gross 
strength.  tt>reover,  whereas  we  have  not  specified  the  forms  of  the  functions 
for  combining  cues  at  the  different  levels,  we  note  that  the  functions  assumed 
In  the  studies  quoted  above  are  compensatory.  For  Instance,  Schustack  and 
Sternberg  (1981)  used  a  linear  regression  model  In  which  judgments  of  causal 
strength  resulted  from  the  trade-offs  between  confirming  and  dlsconflralng 
data.  Furthermore,  one  can  Imagine  trade-offa  between  cues  at  higher  levels; 
e.g.,  between  perceived  validity  and  robustness.  Finally,  the  fact  that 
trade-offs  can  occur  at  all  levels  of  the  hierarchy  demonstrates  the  Inherent 
presence  of  conflict  in  diagnostic  inference  (cf.  Einhorn  &  Hogarth,  1981). 

Having  considered  the  components  of  gross  strength  due  to  data,  we  now 
turn  our  attention  to  the  components  of  gross  strength  due  to  content. 

Content-baaed  cues.  We  first  consider  the  cue  of  temporal  order,  which 
Is  used  to  label  which  of  two  variables  In  a  relation  Is  cause  and  which  Is 
effect.  The  Importance  of  temporal  order  seems  obvious;  as,  for  example.  In 
classical  conditioning.  Indeed,  when  the  order  of  presenting  the  conditioned 
and  unconditioned  stimuli  is  reversed,  learning  is  difficult.  Thus,  attempts 
at  "backward  conditioning"  have  generally  been  unsuccessful. 

An  Interesting  feature  of  temporal  order  Is  the  speed  and  facility  with 
which  it  Is  used — often  without  explicit  awareness.  This  is  particularly  the 
case  In  the  Interpretation  of  language  and  can  be  Illustrated  by  contrasting 
ordinary  discourse  with  a  system  that  Is  both  acausal  and  a temporal;  e.g., 
probability  theory.  To  illustrate,  consider  the  conjunction  "and,"  which 
frequently  Implies  temporal  order  In  everyday  English  (Strawson,  1952);  e.g.. 
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be  went  Into  the  supermarket  and  bought  some  coffee.  If  "going  into  the 
supermarket"  and  "buying  some  coffee"  are  represented  by  S  and  K,  respec¬ 
tively,  how  should  one  understand  the  question,  "What  is  the  probability  of  S 
and  K?"  Whereas  a  statistician  would  represent  the  question  as  p(SftK)  and 
Ignore  the  temporal  meaning  of  "and,"  others  may  well  perceive  the  question  as 
formally  requiring  p(K| S) .  Indeed,  to  direct  attention  to  the  conjunction  of 
the  events,  it  might  be  helpful  to  reverse  S  and  K  in  order  to  break  the 
implied  time  order,  i.e.,  "What  is  the  probability  of  buying  some  coffee  (K) 
and  going  into  the  supermarket  (S)?" 

The  following  experiment  demonstrates  this  point.  Forty-eight  master  of 
business  students  (MBA)  at  the  University  of  Chicago  were  the  subjects.  They 
had  all  taken  at  least  one  graduate  level  statistics  course  and  were  randomly 
divided  into  two  groups.  Subjects  in  one  group  were  then  asked  how  they  would 
represent,  in  probabilistic  terms,  the  question,  "What  is  the  probability  that 
Joe  went  into  the  supermarket  (S)  and  bought  some  coffee  (K)?"  The  second 
group  was  asked  how  they  would  represent  the  question,  "What  la  the  prob¬ 
ability  that  Joe  bought  some  coffee  (K)  and  went  into  the  supermarket  (S)?" 
Subjects  In  both  groups  were  asked  to  choose  their  answers  from  four 
alternatives:  p(SflK),  p(s| K) ,  p(K|s),  none  of  the  above.  The  results  in 

the  first  group  were  that  14  subjects  chose  p(SDK),  9  chose  p(K|s),  and 
one  said  "none  of  the  above”  (no  one  chose  p(s|K».  In  the  second  group, 
however,  18  chose  p(8MQ,  no  one  chose  p(k|s),  3  chose  p(s|K),  and  3 

picked  "none  of  the  above."  Thus,  in  the  first  group,  where  "and"  is  consis¬ 
tent  with  the  usual  temporal  order  of  the  events  considered  here,  there  is 
much  confusion  as  to  whether  the  statement  is  one  of  joint  or  conditional 
probability.  In  the  second  group  there  was  less  confusion  although  the  strong 
implication  of  "and"  did  result  in  three  subjects  choosing  p(S|K).  Tversky 
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and  Kahn man  (1980)  have  also  demonstrated  the  effects  of  implicit  temporal 
ordering  on  attempts  to  interpret  the  meaning  of  information  presented  in 
probabilistic  form. 

Our  second  content  cue  is  contiguity,  which  is  composed  of  spatial  and 
temporal  components.  Contiguity  is  important  because  it  aids  in  focusing 
attention  on  what  variables  occurred  close  in  time  to,  and/or  in  the  vicinity 
of,  some  effect  T  (cf.  MLchotte,  1946).  Indeed,  Slegler  has  shown  that  for 
young  children  (5-6  years  old) ,  temporal  contiguity  is  a  very  strong  cue  for 
Inferring  causality  (Slegler  &  Hebert,  1974;  Slegler,  1976).  H>reover,  these 
studies  show  that  older  children  are  less  dependent  on  contiguity  alone,  being 
able  to  make  use  of  multiple  cues.  Nevertheless,  in  the  absence  of  high  con¬ 
tiguity,  variables  that  are  causally  related  may  not  be  noticed  as  important. 
For  Instance,  the  temporal  gap  between  Intercourse  and  birth  requires  some 
knowledge  of  human  biology  and  chemistry  to  maintain  the  links  between  those 
events.  Similarly,  to  connect  the  raising  of  oil  prices  in  the  mld-East  with 
increases  in  the  U.S.  Inflation  cate  necessitates  an  economic  model  to  bridge 
the  spatial  gap. 

Farcelvad  similarity  is  a  fundamental  cue  to  causality.  Indeed,  in 
judgments  of  gross  strength,  perceived  similarity  is  to  content  what  perceived 
validity  is  to  data.  Like  perceived  validity,  perceived  similarity  can  be 
modeled  as  a  function  of  its  elements,  some  of  which  add,  and  some  subtract, 
from  its  strength.  That  is,  following  Tversky  (1977),  similarity  judgments 
can  be  defined  as  a  weighted  linear  function  of  the  common  elements  of  two 
objects  (cf.  constant  conjunction)  minus  the  distinctive  elements  of  each  (cf. 
discontinuing  data).  However,  to  extend  this  conception  of  similarity  from 
objects  to  causes  and  effects,  it  is  necessary  to  specify  the  common  and 
distinctive  elements  of  the  latter.  These  can  be  considered  at  several 
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levels.  First,  there  Is  a  long-standing  notion  that  cause  and  effect  should 
exhibit  som  degree  of  physical  resemblance.  Mill  noted  that  this  is  a  deeply 
rooted  belief  that,  "not  only  reigned  supreM  in  the  ancient  world,  but  still 
possesses  almost  undisputed  dominion  over  many  of  the  most  cultivated  minds'* 
(cited  In  Nisbett  &  Boss,  1980,  p.  115).  ML11  thought  that  such  a  belief  was 
erroneous  and  many  cases  exist  in  which  physical  resemblance  has  been 
misleading.  For  example,  Nisbett  and  Boss  (1980)  point  out  that  physical 
resemblance  was  the  cornerstone  of  a  medical  theory  called  the  "doctrine  of 
signatures"  whereby  cures  for  diseases  were  thought  to  be  marked  by  their 
resemblance  to  the  symptoms  of  the  disease.  Thus,  the  curing  of  jaundice  was 
attributed  to  a  substance  that  had  a  brilliant  yellow  color  (see  also  Shapiro, 
1960;  Shweder,  1977). 

At  a  second  level,  one  can  consider  similarity  based  on  such  elements  as 
the  length  and  strength  of  cause  and  effect.  That  is,  if  the  effect  of 
interest  Is  large  (l.e.,  is  of  substantial  duration  and/or  magnitude),  people 
will  expect  the  cause(s)  to  be  of  comparable  slse.  For  example,  the  germ 
theory  of  disease  advanced  by  Pasteur  must  have  seemed  incredible  to  his 
contemporaries  in  that  people  were  asked  to  believe  that  invisible  creatures 
caused  death,  plagues,  and  so  on.  In  the  same  way,  it  is  equally  difficult 
for  many  to  believe  that  billions  of  dollars  spent  on  social  programs  in  the 
'60s  and  '70s  could  have  had  little  or  no  effect  or  that  long  term  and  complex 
effects  like  poverty  can  have  short  term  and  simple  causes. 

At  a  higher  level  of  abstraction,  elements  of  similarity  judgments  My 
take  on  Mtaphorlcal  significance.  Thus,  in  attempting  to  explain  a  phenome¬ 
non  that  is  not  understood,  a  search  is  made  for  one  that  is,  and  which  has 
similar  features.  Deductions  from  the  latter  are  then  extrapolated  to  the 
fonser .  Bronowskl  (1978)  has  put  this  well  by  noting  that. 
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.  .  .  every  act  of  imagination  is  the  discovery  of  likeness 
between  two  things  which  were  thought  unlike.  And  the  example 
I  gave  was  Newton's  thinking  of  likeness  between  the  thrown 
apple  and  the  moon  sailing  majestically  in  the  sky.  A  most 
improbable  likeness,  but  one  which  turned  out  to  be  (if  you 
will  forgive  the  phrase)  enormously  fruitful.  (Bronowski, 

1978,  pp.  109-110). 

As  was  the  case  for  data-based  cues,  we  view  content-based  cues  as  also 
forming  part  of  a  hierarchical  structure.  At  a  general  level,  the  Importance 
of  content  in  affecting  gross  strength  is  a  function  of  temporal  order, 
contiguity,  and  perceived  similarity;  i.e.. 

Content  -  [ temporal  order,  contiguity,  similarity)]  (5) 

At  a  second  level,  contiguity  is  comprised  of  temporal  and  spatial  components, 

while  perceived  similarity  is  a  function  of  the  common  and  distinctive 

elements  of  cause  and  effect.  Thus, 

Contiguity  •  h.(time,  space) 

2  (6) 

't (Similarity)  -  (common,  distinctive  elements) 

Moreover,  as  discussed  above,  the  elements  of  perceived  similarity  can  be 

characterized  as  relating  to  physical  resemblance,  congruity  of  length  and 

strength,  and  metaphorical  significance. 

The  existence  of  conflict  among  data-based  cues  is  paralleled  by 
conflicts  among  content-based  cues.  For  example,  when  temporal  contiguity  is 
zero,  it  nullifies  the  effect  of  temporal  order  since  there  can  be  no  temporal 
order  for  simultaneously  occurring  events.  However,  whereas  conflicts 
involving  temporal  order  are  usually  resolved  without  difficulty,  this  is  not 
true  of  conflicts  between  perceived  similarity  and  contiguity.  Indeed,  much 
evidence  exists  that  is  relevant  to  the  conflict  between  these  cues.  This 
evidence  can  be  summarized  by  considering  two  levels  of  each  cue  (high  and 
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low)  In  a  2  x  2  Cable,  as  shown  In  Table  2,  First  consider  the  high-high 

Table  2  about  here 

cell,  examples  of  which  are  given  by  classical  conditioning  and  some  of 
MLchotte's  (1946)  demonstrations  of  the  perception  of  causality  induced  by 
moving  objects.  In  particular,  MLchotte's  subjects  reported  perceiving  causal 
relations  when  the  movement  of  objects  after  contact  was  congruent  with  prior 
trajectories  and/or  positions.  On  the  other  hand,  in  the  lower  left  hand  cell 
(high  contiguity,  low  similarity),  there  was  no  perception  of  a  causal 
relation.  Indeed,  MLchotte  noted  that  to  obtain  a  causal  effect,  "requires  a 
certain  degree  of  similarity  between  the  movement  of  the  agent  and  the  change 
shown  in  the  patient,  without  which  the  change  would  not  appear  as  an 
'extension'  of  the  first.  This  is  why  there  is  no  causal  Impression  when  the 
movements  made  by  the  two  objects  are  in  directly  opposite  directions,  or  at 
least  very  different  ones"  (MLchotte,  1946,  p.  210 — citation  translated  by 
authors).  Further  evidence  concerning  this  cell  is  provided  by  the  literature 
that  demonstrates  the  limits  of  classical  conditioning.  For  example,  whereas 
Watson  and  his  colleagues  were  able  to  condition  little  Albert  to  fear  rabbits 
by  pairing  the  appearance  of  a  rabbit  with  that  of  a  large  noise,  they  could 
not  produce  the  same  effect  when  the  rabbit  was  replaced  by  a  block  of  wood  or 
a  cloth  curtain  (Nisbett  &  Ross,  1980,  p.  104).  Seligman  (1970)  reviewed  many 
learning  studies  and  concluded  that  organisms  are  differentially  prepared  to 
learn  different  types  of  relations.  The  extent  to  which  such  limits  are 
biological  and  could  be  overcome  by  experiencing  relevant  environmental 
contingencies  is,  of  course,  controversial.  However,  the  fact  remains  that 
similarity  betweer  cause  and  effect  in  terms  of  congruity  of  length  and 
strength  and/or  physical  resemblance  is  a  crucial  cue,  and  may  often  be 
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Evidence  on  the  coaperison  between  the  lower  left  and  upper  right  cells 
has  been  provided  by  Garcia  and  his  colleagues  (Garcia,  et  al.,  1968;  Garcia, 
et  al.,  1972;  also  see  Garcia,  1981,  for  a  review).  They  have  demonstrated 
that  although  rats  can  learn  to  associate,  after  one  trial,  distinctive 
tasting  food  and  a  gastro-lntestinal  illness  suffered  several  hours  later 
(induced  by  x-rays),  they  do  not  learn  to  associate  a  different  shape  of  food 
with  illness  when  the  taste  is  familiar.  Shultz  and  Ravinsky  (1977)  have 
further  investigated  the  relative  weights  accorded  to  similarity  vs.  temporal 
contiguity  among  children  varying  in  age  from  6  to  12.  For  6-year  olds,  they 
found  that  similarity  outweighed  temporal  contiguity;  however,  the  older 
children  favored  contiguity  over  similarity. 

Finally,  the  conflict  between  similarity  and  contiguity  can  sometimes  be 
resolved  by  distinguishing  between  two  types  of  cause.  In  particular,  con¬ 
sider  the  difference  between  a  "precipitating”  and  "underlying"  cause.  The 
former  is  generally  some  action  or  event  that  is  high  in  temporal  and  spatial 
contiguity  but  low  in  similarity  of  length  or  strength  with  the  event.  The 
latter  is  generally  based  on  high  similarity  of  length  and  strength,  with 
contiguity  being  less  important.  Thus,  the  precipitating  cause  of  World  Iter  I 
was  an  assassination  in  Sarajevo,  but  the  underlying  cause(s)  were  economic 
upheaval,  German  nationalism,  and  so  on. 

Data  and  Content-Based  Cues 

In  real  world  judgments,  data  and  content-cues  are  invariably  present, 
thus  raising  the  question  as  to  how  they  are  combined.  We  now  consider 
empirical  evidence  concerning  this  issue,  but  viewed  within  our  theoretical 
framework. 

First  consider  the  work  on  children's  causal  judgments.  Sedlak  and  Kurtz 
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(1981)  have  reviewed  the  research  on  factors  that  Influence  causal  Inference 

and  focus  on  three  cues:  temporal  order,  constant  conjunction,  and  spatio- 

temporal  contiguity.  Note  that  these  cues  are  three  of  the  necessary  features 

of  the  Humean  doctrine  of  causality  (Cook  &  Campbell,  1979);  moreover,  they 

Involve  content  and  data.  In  summarizing  the  results  of  studies  where  these 

cues  conflict,  Sedlak  and  Kurtz  (1981)  have  stated: 

When  a  temporal  delay  is  Introduced,  only  children  8  years  of 
age  or  older  offer  reliable  evidence  of  covariation.  Specif¬ 
ically,  the  covariation  use  of  younger  children  deteriorates 
when  the  covarying  cause  Is  temporally  non-contiguous  with  the 
effect  and  they  encounter  problems  in  (1)  rationalizing  the 
delay,  or  (2)  recognizing  the  constancy  of  the  delay.  Where 
these  tasks  are  made  difficult,  a  temporally  contiguous  (albeit 
inconsistent)  cause  Is  preferred.  Thus  we  can  tentatively  rank 
order  the  three  Humean  principles.  It  appears  that  temporal 
antecedence  supercedes  spatial  contiguity  (Bullock  &  Gelman, 

1979),  and  that  temporal  (and  spatial)  contiguity  may  sometimes 
outweigh  the  covariation  factor  in  children's  judgments 
(Nsndelson  &  Shultz,  1976;  Siegler,  197S;  Slegler  &  Llebert, 

1974).  (Sedlak  &  Kurtz,  1981,  p.  763). 

Furthermore,  in  studies  that  have  contrasted  covariation  with  similarity, 
Shultz  and  Ravinsky  (1977)  found  that  6-year  olds  were  unwilling  to  label 
dissimilar  factors  as  causes,  even  In  the  presence  of  systematic  covari¬ 
ation.  On  the  other  hand,  older  children  (10-12  years  old)  favored 
covariation  over  similarity. 

In  the  literature  on  adults'  judgments  of  covariation,  the  Importance  of 
content  is  emphasized  by  the  effects  of  labeling  variables.  For  example, 
Jennings,  Amablle,  and  Ross  (1982)  found  that  when  people  viewed  scatterplots 
of  variables  labeled  X  and  Y,  the  statistical  correlation  had  to  be  quite  high 
for  a  relation  to  be  seen.  However,  when  only  given  the  names  of  pairs  of 
variables,  subjects  judged  the  relations  to  have  different  strengths.  Thus, 

In  the  absence  of  data,  the  cues  evoked  by  labeling  variables  affected 
perceived  relational  strength.  The  effects  of  labeling  have  also  been 
examined  in  probability  learning  studies  and  confirm  the  Importance  of 
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content-based  cues  to  causality.  Foe  example,  Adelman  (1981)  found  that 
subjects  in  a  multiple-cue  probability  task  learned  quite  well  when  variable 
labels  were  congruent  with  the  strength  of  the  statistical  relation  but  not 
otherwise.  CSmerer  (Note  3)  showed  that  subjects  were  able  to  learn  a 
dlsordlnal  Interaction  only  when  the  variables  were  labeled  In  accord  with 
prior  beliefs  (this  involved  factors  thought  to  affect  the  price  of  wheat 
futures  In  a  commodity  market).  When  the  same  task  was  given  as  an  abstract 
problem  with  variables  labeled  as  X^ ,  1^,  and  Y,  no  learning  occurred. 

In  a  further  probability  learning  study.  Miller  (1971)  Investigated  the 
effects  of  misleading  cue  labels.  In  the  presence  of  such  conflict,  he  found 
that  statistically  sophisticated  subjects  concentrated  on  the  data  and  Ignored 
the  labels,  whereas  many  other  subjects  adopted  the  reverse  strategy.  Thus, 

In  addition  to  illustrating  the  conflict  between  data  and  content.  Miller's 
(1971)  study  also  demonstrated  that  differences  due  to  training  may  affect  how 
such  conflicts  are  resolved. 

The  conflict  between  cues  that  reflect  content  and  data  can  be  high¬ 
lighted  by  considering  the  concept  of  spurious  correlation  (Elnhorn  &  Hogarth, 
1982).  The  existence  of  this  concept  suggests  that  some  correlations  are  more 
(or  less)  causally  related  than  others,  and  thereby  raises  the  issue  of  how  to 
tell  the  difference  (cf.  Simon,  1954).  For  example,  consider  the  correlation 
between  the  number  of  pigs  and  the  amount  of  pig-iron  (Ehrenberg,  1975).  Such 
a  correlation  seems  spurious  when  the  common  causal  factor,  "economic 
activity,"  Is  considered.  On  the  other  hand,  consider  the  correlation  between 
amount  of  rain  and  number  of  auto  traffic  accidents  in  a  city,  over  the  course 
of  a  year.  Such  a  correlation  does  not  seem  spurious  (or  at  least,  less 
spurious).  What  Is  the  difference  between  these  two  cases? 
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If  we  make  use  of  the  cues  to  causality ,  the  spuriousness  of  the  cor¬ 
relation  between  pigs  and  pig-iron  becomes  apparent.  That  is,  although  the 
data  point  to  a  statistical  relation,  the  content  cues  point  away  from  a 
causal  relation;  specifically,  temporal  order  (which  cannot  be  used  to  specify 
which  variable  is  cause  or  effect,  thereby  also  nullifying  temporal 
contiguity);  low  spatial  contiguity  (it  being  unlikely  that  farms  and  fac¬ 
tories  are  in  close  physical  proximity);  and  similarity  of  the  variables  is 
only  with  respect  to  their  names.  Indeed,  the  judgment  that  the  relation  is 
spurious  is  made  easily  and  quickly. 

Now  consider  the  second  case:  assuming  that  the  data  point  to  a  statis¬ 
tical  relation,  note  how  the  content  cues  reinforce  that  link.  The  temporal 
order  of  rain  and  accidents  is  clear;  contiguity  is  high  both  for  time  and 
space;  and  similarity,  via  the  use  of  prior  knowledge  about  the  effects  of 
slippery  roads,  is  high.  There  seems  less  doubt  that  the  correlation  is 
"real." 

When  data  and  content  conflict,  spurious  correlation  is  not  the  only 
outcome;  e.g.,  a  low  or  zero  statistical  correlation  could  mask  a  true  causal 
relation.  To  illustrate,  imagine  that  we  were  ignorant  as  to  the  cause  of 
birth;  However,  it  has  been  suggested  that  sexual  intercourse  is  related  to 
pregnancy  and  the  following  experiment  was  designed  to  test  this  hypothesis: 
100  couples  were  allocated  at  random  to  an  Intercourse  condition,  and  100  to  a 
non-intercourse  condition.  As  indicated  in  Table  3,  25  females  became  preg- 

Table  3  about  here 

nant,  and  175  did  not.  In  light  of  our  current  knowledge  (but  unknown  to  our 
hypothetical  selves),  we  can  state  that  the  5  people  in  the  no-intercourse/ 
yes-pregnancy  cell  represent  "measurement  error,"  l.e.,  faulty  memory  in 
reporting,  lying,  etc.  Since  the  statistical  correlation  is  small  (r  •  0.34), 
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we  sight  question  whether  the  hypothesis  Is  worth  pursuing.  Indeed,  If  the 
sample  else  were  smeller,  the  eorreletlon  might  not  even  be  “significant.” 
tt>reover,  even  with  a  significant  correlation,  r  »  0.12,  which  Is  hardly  a 
compelling  percentage  of  the  Y  variance  accounted  for  by  X. 

There  are  two  Important  Implications  of  this  example.  First,  whereas 
statistics  texts  correctly  remind  us  that  correlation  does  not  necessarily 
imply  causation,  the  imperfect  nature  of  this  cue  to  causality  is  also 
reflected  In  the  statement:  causation  does  not  necessarily  Imply  cor¬ 
relation.  He  have  somewhat  facetiously  labeled  examples  of  the  latter  as 
"causalatlons,”  giving  them  equal  standing  with  the  better-known  and  opposite 
concept  of  spurious  correlation.  Second,  causalation  demonstrates  that  sole 
reliance  on  statistical  measures  for  understanding  and  Interpreting  data  is 
insufficient,  thereby  highlighting  the  role  of  judgment  (see  also  Simon, 

1954).  tbreover,  specification  of  the  cues  to  causality  provides  some  hint  as 
to  the  components  of  such  judgments. 


The  Role  of  Alternative  Explanations 

The  role  of  alternative  explanations  in  determining  the  net  strength  of  a 
causal  relation  was  explicated  in  equation  (1).  M>reover,  as  stressed  by 
Campbell  and  colleagues  (Campbell  &  Stanley,  1963;  Campbell,  1969;  Cbok  & 
Campbell,  1979),  causal  strength  should  be  evaluated  by  the  ruling  out  of 
alternative  explanations.  In  fact.  Mickle  (1974)  states  that  the  primitive 
notion  of  a  cause  Involves  asking  oneself  the  question:  "Would  Y  have 
occurred  If  X  had  not?”  The  greater  the  number  of  alternative  explanations 
underlying  a  "yes”  answer,  the  lower  the  causal  relevance  of  X  for  Y.  Note 
that  the  posing  and  answering  of  the  above  question  (the  "counterfactual 
conditional”)  may  either  Involve  doing  a  real  experiment  or  what  Is  called  a 


"thought"  experlaent.  In  the  former ,  one  comperes  the  effect  of  X  on  Y 
with  that  of  X  on  Y  (the  control  group  condition).  In  this  way,  the 
counterfactuel  question  is  easily  answered.  Moreover,  note  that  a  control 
group  allows  one  to  infer  that  X  is  the  only  difference  in  the  field  that 
can  affect  Y  (since  the  control  groupie  the  field).  However,  even  in  real 
or  quasi-experiments ,  alternative  explanations  can  exist  if  other  cues  to 
causality  are  ambiguous.  For  example,  Campbell  (1969)  points  out  that:  (e) 
the  gradual  introduction  of  some  experimental  change  makes  the  determination 
of  its  causal  Impact  more  difficult  than  a  sharp  introduction,  as  for  example 
when  a  remedial  program  is  phased  in  over  a  long  time  period  rather  than 
implemented  all  at  once;  (b)  unless  replication  of  the  XDY  relation  is 
accomplished  by  the  Introduction  of  X  over  multiple  tine  periods  or  over 
different  units  within  the  same  time  period,  evidence  of  constant  conjunction 
is  weak;  (c)  the  determination  of  causal  relations  when  effects  are  not 
contiguous  in  time  and  space  with  the  manipulated  variables  is  problematic. 

For  example,  variables  that  have  "lagged"  effects  or  cumulative  effects  over 
time  are  difficult  to  Isolate. 

When  real  or  quasi-experiments  are  not  possible,  one  can  nevertheless 

engage  in  the  following  thought  experiment  in  order  to  answer  the  question, 

"Would  Y  have  occurred  if  X  hadn't?":  Imagine  the  world  before  X,  go 

forward  to  where  X  would  occur,  and  then  delete  it  from  the  scenario.  Now 

run  the  scenario  forward  from  that  point  to  see  if  Y  happens  or  not. 

Clearly,  in  such  thought  experiments  the  construction  of  "possible  worlds"  and 

imaginary  scenarios  is  crucial  for  Judging  causal  significance  (see  also 

Kahneman  &  Tversky,  1982).  In  fact,  according  to  Mickle  (1974), 

The  key  item  is  a  picture  of  what  would  have  happened  if  things 
had  been  otherwise,  and  this  is  borrowed  from  some  experience 
where  things  were  otherwise.  It  is  a  contrast  case  rather  than 
the  repetition  o£  like  instances  that  contributes  most  to  our 
primitive  concept  of  causation,  (p.  57) 
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The  use  of  alternative  explanations  has  iaportant  implications  for  making 
inferences  in  general  and  for  interpreting  experiments  in  particular.  As 
recognised  by  Hume,  "not  only  in  philosophy,  but  even  in  common  life,  we  may 
attain  the  knowledge  of  a  particular  cause  merely  by  one  experiment,  and  after 
a  careful  removal  of  all  foreign  and  superfluous  circumstances’*  (as  quoted  in 
Mickie,  1974,  p.  25).  As  a  case  in  point,  consider  the  following  one-shot 
case  study  with  a  single  datum:  The  occurrence  of  a  huge  explosion  near  Los 
Alamos,  New  Ifexico,  in  Ally  1945.  No  one  doubted  this  to  be  the  effect  of 
detonating  an  atomic  bomb.  Clearly,  inferring  causality  in  this  poorly 
designed  experiment  was  not  difficult  whereas  assessing  causality  in  the  most 
meticulously  designed  experiments  in  social  science  Is  often  problematic  at 
best.  When  one  considers  why  the  causal  inference  is  so  strong  in  the  bomb 
example,  ask  yourself  the  following  question:  "Would  an  explosion  of  such 
magnitude  have  occurred  if  an  atomic  bomb  had  not  gone  off?"  While  it  is 
possible  to  think  of  alternative  explanations  for  the  explosion,  they  are  so 
unlikely  as  to  be  virtually  non-existent,  tbreover,  note  how  the  other  cues 
to  causality  point  to  a  causal  relation  (temporal  order,  contiguity  in  time 
and  space,  similarity  of  the  "unusualness"  of  effect  and  cause,  and  so  on). 
Therefore,  even  in  one-shot  case  studies  with  no  control  group,  causality  can 
be  Inferred  (see  Campbell,  1975  for  an  illuminating  discussion  of  this  issue). 

Tests  of  the  Theoretical  Model 

Given  the  various  components  of  our  framework,  what  would  constitute  an 
adequate  experimental  test  of  its  validity?  In  order  to  help  answer  this,  we 
have  summarized  our  theoretical  modal  in  the  schematic  representation  shown  in 
Table  4.  The  overall  model  is  presented  as  a  hierarchical  arrangement  of  cues 

Insert  Table  4  about  here 
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at  different  levels  of  generality.  However,  the  hierarchical  structure  Is  not 
necessary  for  the  aodel;  It  Is  simply  a  useful  way  to  represent  the  fact  that 
cues  at  one  level  of  specificity  are  combined  to  form  others.  Table  A  makes 
clear  that  In  addressing  the  question  posed  above,  we  are  confronted  with  the 
Issue  as  to  which  level  or  levels  to  Investigate.  However,  given  that  we  have 
already  reviewed  much  of  the  extensive  research  concerning  aspects  of  levels  1 
and  2,  we  will  focus  our  attention  on  the  less  studied  aspects  represented  by 
levels  3  and  4.  In  particular,  since  most  everyday  judgments  Involve  some 
mixture  of  content  and  data,  the  real  significance  of  this  research  Is  to 
explicate  how  these  components  are  combined.  Furthermore,  given  our  emphasis 
on  the  Importance  of  alternative  explanations  in  affecting  causal  Judgments, 
we  need  to  test  if  the  conceptual  distinction  between  gross  and  net  causal 
strength  can  be  empirically  demonstrated. 

The  decision  to  investigate  the  roles  of  content,  data,  and  alternative 
explanations,  has  advantages  and  disadvantages.  The  chief  advantage  is  that 
it  narrows  the  empirical  work  to  a  manageable  level  while  addressing  Important 
Issues.  However,  this  comes  at  the  cost  of  ignoring  the  many  interesting 
process  details  suggested  by  the  model.  We  are  sensitive  to  this  concern  and 
can  only  state  that  our  model  Invites  a  programmatic  study  of  causal  judgments 
that  we  Intend  to  pursue.  For  now,  however,  we  consider  the  more  general 
issues. 

experimental  Evidence 

We  now  present  two  experiments  that  were  designed  to  demonstrate  the  use 
of  data  and  content-based  cues  to  causality  as  well  as  the  role  of  specific 
alternative  explanations.  In  both  experiments,  subjects  were  first  required 
to  read  short  scenarios  and  then  rate  the  likelihood  that  two  variables  were 
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causally  related.  The  specific  content-based  cues  varied  In  this  task  were 
contiguity  and  perceived  similarity  (defined  operationally  below);  the  data- 
based  cues  were  the  four  dichotomous  data  cues  on  which  judgments  of  perceived 
validity  are  based.  The  role  of  specific  alternatives  was  Investigated  by 
providing  subjects  with  a  particular  alternative  and  then  asking  for  a  re¬ 
assessment  of  causal  strength. 

Our  hypotheses  are:  (1)  variations  in  the  levels  of  data  and  content- 
based  cues  will  affect  net  causal  strength.  Specifically,  higher  levels  of 
the  cues  will  lead  to  judgments  of  stronger  causal  relations;  and,  (2)  con¬ 
sideration  of  specific  alternatives  will  reduce  judged  causal  strength. 
However,  such  reductions  will  vary  according  to  the  gross  strength  of  the 
alternatives.  These  hypotheses  can  be  formalized  as  follows: 

HI:  s(Y,Xj|b)  -  &1[¥(rxy}]  +  B2( contiguity)  +  dissimilarity)]  +  e  (7) 

Note  that  we  have  not  included  a  term  to  represent  interactions  since  we  have 
no  a  priori  theory  regarding  the  nature  of  such  interactions. 

To  test  for  the  effects  of  specific  alternatives,  note  that  the  judged 
causal  strength  of  variable  Xj  can  be  expressed  as, 

«  (T.Xjl*)  -  FtslY.XjjB)  -  v]  (8) 

where,  v  represents  the  gross  strength  of  specific  alternatives  a  subject 
might  generate.  However,  since  we  have  no  control  over  the  subject's  use  of 
Imagination  in  generating  such  alternatives,  we  assume  that  this  is  a  constant 
across  subjects.  When  a  specific  alternative,  X^,  is  provided,  the 
subject's  re-assessment  can  be  written, 


t  *tt*sd**V  VJ-fW.1  »  it1*  hi.  rr- 
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S^Cl.Xj!8)  -  PtsCY.XjjBj  -  v  -  s(Y,xJb)]  (9) 

Thus,  if  we  assuae  that  the  function  F  is  linear,  the  difference  between  the 
two  judgments  Is, 

\  "  Sn2  *  *[*(*.*J»)]  (10) 

In  other  words,  the  difference  between  the  two  judgments  will  be  related  to 
the  gross  strength  of  the  specific  alternative  considered.  In  the  first 
experiment,  subjects  were  given  alternative  explanations  all  of  which  had  high 
gross  strength.  Thus  we  hypothesise, 

H2a:  (Sn  -  S  )  >  0  (11) 

1  2 

In  the  second  and  third  experiments,  half  of  the  alternative  explanations  had 
high  gross  strength,  and  half  had  low  gross  strength.  Thus, 

H2b:  (Sq  -  Sq  )  Is  positively  related  to  s(y,XjJb). 


Experiment  1 

Subjects 

Thirty-two  subjects  participated  in  experiment  1.  They  were  recruited 
through  an  advertisement  In  the  University  student  newspaper  and  were  offered 
$3  an  hour  to  participate  In  an  experiment  on  judgment.  Their  median  age  was 
24,  their  educational  level  was  high  (mean  of  4.4  years  of  post  high  school 
education),  and  there  were  16  males  and  16  females. 
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Stimuli 

The  stiauli  consisted  of  eight  scenarios  varying  in  length  from  100  to 
200  words.  These  concerned:  (1)  The  efficacy  of  accounting  reports  in  a 
chain  of  supermarkets;  (2)  the  study  habits  of  a  graduate  student;  (3)  food¬ 
poisoning  following  a  church  picnic;  (4)  weight-loss  after  attending  a  health 
program;  (S)  the  effects  of  environmental  factors  on  the  health  of  high  school 
students;  (6)  the  playing  schedule  of  a  tournament  tennis  player;  (7)  the 
effects  of  new  school  textbooks  on  academic  performance;  and  (8)  the  relation 
of  diet  to  the  performance  of  marathon  runners.  A  version  of  the  church 
picnic  scenario  is  reproduced  below. 

A  group  of  52  young  adults  is  affiliated  with  a  local  church. 

The  group  was  Invited  to  a  free  picnic  last  Wednesday,  where 
all  the  food  and  beverages  were  provided  by  the  church's 
office.  Thirty-seven  of  the  members  attended.  On  Sunday  many 
of  the  members  were  ill  complaining  of  sore  throats  and  colds. 

There  were  25  group  members  who  were  ill;  22  of  whom  had 
attended  the  picnic  while  J  had  not.  Twenty-seven  of  the 
members  were  not  ill  and  of  these  15  attended  the  picnic  while 
12  had  not. 

How  likely  do  you  think  it  is  that  the  illness  was  caused 
by  food  poisoning? 

BOTE  that  there  may  not  be  a  causal  relationship. 

(Please  mark  an  X  in  the  appropriate  place  on  the  scale  below.) 


0  10  20 

Food  poisoning 
did  not  cause 
the  illness. 


30  40  50  60  70 

There  is  a  50/50 
chance  that  food 
poisoning  caused 
the  illness. 


80  90  100 

Food  poisoning 
was  the  cause 
of  the  illness 


The  warning,  "NOTE  that  there  may  not  be  a  causal  relationship,’'  was 


added  to  avoid  ceiling  effects  in  the  use  of  the  0-100  scale.  That  is,  both 


our  pilot  studies  and  prior  research  (Peterson,  1980;  Lipe,  Note  2)  indicated 
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Chat  subjects  in  an  experimental  situation  tend  to  assume  that  there  is  a 
relation  between  two  variables  if  they  are  asked  to  assess  its  strength. 

Since  our  Interest  lay  in  detecting  differences  between  scenarios  involving 
different  levels  of  the  cues,  the  warning  note  was  added  to  encourage  subjects 
to  use  the  full  range  of  the  scale.  The  stimulus  given  above  represents  a 
case  where  contiguity  was  low  (the  illness  was  four  days  after  the  picnic), 
perceived  similarity  was  low  (sore  throats  and  colds  are  not  similar  to  the 
effects  of  food  poisoning) ,  but  the  correlation  was  high  (see  below) .  For 
this  scenario,  high  values  for  contiguity  and  perceived  similarity  were 
achieved  by  having  the  picnic  take  place  on  Saturday  (i.e.,  one  as  opposed  to 
four  days  before  the  effect),  and  having  "nausea  and  stomach  cramps"  as  the 
symptoms  of  the  illness. 

The  alternative  explanations  in  experiment  1  were  all  designed  to  have 
high  gross  strength.  For  example,  the  alternative  for  the  above  scenario  was, 
"A  flu  virus  was  also  'going  around*  the  neighborhood  at  this  time."  Thus, 
for  this  explanation,  both  contiguity  and  perceived  similarity  were  high. 

Operational  Definitions 

Two  levels,  high  and  low,  of  each  of  the  causal  cues  were  made 
operational  in  the  following  manner.  For  perceived  similarity,  we  first 
created  cause-effect  pairs  that  we  deemed  to  vary  in  similarity.  This  was 
Independently  verified  by  having  subjects  rate  the  similarity  of  cause-effect 
pairs  on  a  0-10  scale.  The  mean  judgments  for  the  high  similarity  pairs  was 
6.7  while  the  mean  for  the  low  similarity  pairs  was  3.1.  Independent  ratings 
were  also  collected  for  judgments  of  similarity  for  specific  alternatives 
(mean  of  7.3).  For  contiguity,  high  and  low  levels  were  simply  defined  by 
their  physical  values  (e.g.,  time  in  days).  For  perceived  validity,  several 
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studies  have  shown  this  cue  to  be  sensitive  to  the  difference  between  confirm¬ 
ing  and  dlsconfirmlng  data  (Crocker,  1981;  Llpe,  Note  2;  Schustack  & 

Sternberg,  1981).  Thus,  In  the  high  condition,  the  ratio  of  confirming  to 
dlsconfirmlng  data  was  set  at  approximately  2  to  1;  in  the  low  condition,  the 
scenarios  contained  equivalent  amounts  of  confirming  and  dlsconfirmlng  data. 

In  fact,  to  avoid  giving  subjects  Identical  patterns  of  data  across  scenarios, 
the  distribution  of  data  in  the  four  dichotomous  cells  was  slightly  varied. 
Statistically,  the  high  validity  condition  can  be  characterized  by  correlations 
between  .33  and  .40,  the  low  condition  by  coefficients  between  .00  and  .10. 

Procedure  and  Design 

Subjects  were  presented  with  a  booklet  containing  the  8  scenarios  as  well 
as  several  other  experimental  tasks.  They  were  instructed  to  work  at  their 
own  pace  and  the  average  completion  time  was  1  hour.  The  8  scenarios  were 
interspersed  with  other  material  to  minimize  carry-over  effects  and  to  provide 
variety  for  the  subjects.  After  reading  each  scenario,  subjects  were  required 
to  fill  out  the  rating  form  shown  above.  Furthermore,  they  were  permitted  to 
make  notes  or  calculations.  After  completing  the  rating,  they  were  presented 
with  a  specific  alterr  tlve  explanation  on  the  following  page.  They  then  re¬ 
rated  the  strength  of  the  original  causal  variable  and  proceeded  to  the  next 
task. 

The  experiment  followed  a  4-factor  wlthin-subjects  design  where  the  first 
three  factors  were  the  causal  cues,  and  the  fourth  factor  contained  the  8 
scenarios  arranged  in  a  La tin-square.  Specifically,  each  subject  rated  8 
different  scenarios,  where  each  scenario  contained  one  of  the  2  x  2  x  2  •  8 
combinations  of  the  cues.  In  order  to  form  an  8  x  8  Latin-square,  4  subjects 
were  randomly  assigned  to  each  of  8  groups.  Since  a  Latin-square  involves  an 
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incomplete  design,  it  should  be  noted  that  one  cannot  test  all  possible  inter¬ 
actions.  However,  since  we  have  no  a  priori  theory  regarding  Interactions, 
this  is  a  minor  limitation  of  the  design.  On  the  other  hand,  since  some 
Interactions  can  be  tested,  we  chose  to  examine  those  of  greatest  potential 
importance . 

Results 

The  results  of  the  experiment  are  shown  in  Tables  5  and  6.  Gbnsider 
Insert  Tables  5  and  6  about  here 

the  main  effects  and  interactions  of  the  cues.  Table  5  shows  that  both 
perceived  similarity  and  validity  are  significant  and  in  the  hypothesized 
direction  (see  Table  6).  However,  there  is  no  effect  for  contiguity. 

Moreover,  there  is  a  small  but  significant  interaction  between  perceived 
similarity  and  validity  such  Chat  high  levels  of  both  factors  have  a  non- 
additive  effect  (see  Table  6).  Furthermore,  there  is  also  a  strong  scenario 
main  effect  as  well  as  two  weaker  interactions  (Table  5).  These  effects  show 
that  the  specific  content  of  the  scenarios  is  not  only  important  in  Itself, 
but  also  affects  the  perception  of  the  causal  cues.  In  particular,  the 
scenario  X  perceived  validity  interaction  is  of  theoretical  Interest  since  it 
supports  the  idea  chat  the  effects  of  data  on  judged  causal  strength  are 
Influenced  by  the  content  in  which  they  occur.  Finally,  our  hypothesis 
concerning  the  effects  of  alternative  explanations  was  strongly  confirmed. 
Specifically,  across  all  scenarios  and  cue-combinations,  the  mean  judgment  of 
causal  strength  dropped  from  34  to  24  after  the  introduction  of  a  specific 
alternative.  Furthermore,  the  reduction  was  statistically  significant  within 
7  of  the  8  scenarios  (p  <  .01). 
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bpttlMnt  2 

Rationale 

There  were  three  objectives  In  conducting  experiment  2:  (1)  to  replicate 

the  findings  of  experiment  1;  (2)  to  Investigate  whether  contiguity  would  be  used 
in  scenarios  that  emphasized  this  cue  more  strongly;  and,  (3)  to  explore  the 
differential  effects  of  strong  vs.  weak  alternatives  on  net  causal  strength. 

Subj  ects 

Thirty-two  subjects  participated  in  experiment  2.  They  were  recruited  in 
the  sane  way  as  subjects  in  experiment  1  and  were  also  paid  $5  an  hour.  Their 
median  age  was  22,  their  educational  level  was  high  (mean  of  3.3  years  past 
high  school),  and  there  were  approximately  as  many  males  as  females. 

Stimuli 

The  stimuli  consisted  of  the  first  four  scenarios  used  in  experiment  1 
(see  above).  However,  each  scenario  was  modified  to  emphasize  the  contiguity 
cue.  For  example,  in  the  picnic  scenario  described  earlier,  the  timing  of 
potential  causes  and  effects  was  made  salient  by  introducing  a  specific  event 
at  the  picnic  (the  performance  of  a  comedian)  and  then  emphasizing  the  time  at 
which  certain  events  occurred  relative  to  this  event.  In  another  scenario, 
two  persons  were  Introduced  who  specifically  discussed  the  issue  of  timing 
between  the  effect  and  potential  cause.  At  the  end  of  each  scenario,  subjects 
were  given  the  same  rating  scale  as  in  experiment  1. 

The  alternative  explanations  in  experiment  2  were  designed  to  have  either 
high  or  low  gross  strength.  The  high  gross  strength  alternatives  ware  the 
same  as  in  experiment  1.  The  low  gross  strength  alternatives  ware  designed  to 
have  low  similarity  between  cause  and  effect.  For  example,  in  the  picnic 
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scenario,  Che  weak  alternative  was,  “The  picnic  attenders  had  also  partici¬ 
pated  in  a  hymn  sing  immediately  after  lunch." 

Operational  Definitions 

All  cue a  were  defined  operationally  as  in  experiment  1.  Mjreover,  the 
mean  rating  of  similarity  for  the  low  gross  strength  alternatives  was  0.9  as 
compared  to  7.5  in  the  high  condition. 

Procedure  and  Design 

Experimental  procedures  were  Identical  to  the  first  experiment.  However, 
the  design  differed  in  the  following  respects:  each  subject  received  4 
scenarios  rather  chan  8,  but  the  8x8  La tin-square  was  maintained  by  having 
each  pair  of  subjects  complete  the  2  x  2  x  2  -  8  combinations  of  the  cues. 
Thus,  we  maintained  the  four-factor,  withln-subjects  La tin-square  design  as  in 
experiment  1.  Furthermore,  in  order  to  test  for  alternative  strength,  2  of  the 
4  scenarios  rated  by  each  subject  contained  strong  alternatives  and  2  con¬ 
tained  weak  ones.  Weak  and  strong  alternatives  were  balanced  across  scenarios 
and  combinations  of  the  causal  cues. 

Results 

The  results  of  experiment  2  are  shown  in  Table  7.  As  can  be  seen,  the 

Insert  Table  7  about  here 

three  main  effects  for  the  cues  are  significant,  as  is  the  main  effect  for 
scenarios.  Furthermore,  the  mean  difference  in  judged  causal  strength  for  the 
high  vs.  low  levels  of  the  cues  is  quite  large;  specifically,  for  perceived 
validity  the  means  are  47  vs.  30;  for  perceived  similarity  the  means  are  45 
vs.  32;  for  contiguity  they  are  43  vs.  34.  Thus,  the  results  for  the  first 
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two  cues  replicate  our  findings  in  experiment  1.  However,  the  significant 
«eemte*v««««iNi0a**MidN*aa*anm*hmMi*MmaMamefcMamriedama*aati*amm* 

result  for  contiguity  illustrates  that  this  cue  can  be  used  in  combination 
with  the  others  if  it  is  made  sufficiently  salient.  Therefore,  we  consider 
these  results,  together  with  those  of  experiment  1,  as  supporting  the 
hypothesis  that  gross  strength  can  be  expressed  as  a  weighted,  additive 
function  of  both  data  and  content-based  cues  to  causality. 

The  results  for  specific  alternative  explanations  are  now  considered.  He 
first  examine  each  scenario  to  see  if  judged  causal  strength  decreased  after 
considering  a  specific  alternative  (ignoring  whether  the  alternative  had  a 
high  or  low  gross  strength).  For  two  of  the  four  scenarios,  there  was  a 
significant  decrease  (p  <  .01),  while  a  third  was  marginal  (p  <  .07).  The 
fourth  scenario  showed  no  effect  (this  was  the  same  scenario  that  showed  no 
effect  In  experiment  i).  However,  in  comparing  the  decrease  in  judged  causal 
strength  for  the  good  vs.  poor  alternatives,  we  only  found  a  significant 
difference  in  mean  decrease  in  one  scenario  (p  <  .05).  Thus,  while  hypothesis 
2a  is  supported  in  both  experiments,  hypothesis  2b  is  In  doubt.  In  order  to 
investigate  this  more  fully,  we  performed  a  third  experiment. 

Experiment  3 

Subjects 

Eighty  subjects  participated  in  experiment  3.  They  were  all  MBA  students 
at  the  University  of  Chicago,  enrolled  in  the  basic  graduate  level  statistics 
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Stimuli 

The  stimuli  consisted  of  two  of  the  eight  scenarios  drawn  froa  those  used 
in  the  first  experiment.  The  two  were  characterized  as  having  high  levels  on 
all  three  of  the  causal  cues.  The  reason  for  this  was  to  insure  that  judged 
causal  strength  be  high  enough  so  as  to  make  the  introduction  of  alternatives 
relevant;  i.e.,  if  the  original  factor  has  low  causal  strength,  there  is  no 
need  for  subjects  to  consider  alternative  explanations.  Furthermore,  for  each 
scenario,  there  were  two  possible  alternatives,  one  high  in  gross  strength  and 
the  other  lew.  Moreover,  for  each  scenario,  half  of  the  stimuli  were  paired 
with  the  strong  alternative  and  half  with  the  weak. 

Operational  Definitions 

Since  only  "high"  levels  of  the  cues  were  used  in  this  experiment,  these 
levels  were  set  as  in  the  previous  two  experiments.  Thus ,  a  2:1  ratio  of 
confirming  to  disconfirmlng  data  was  used  for  perceived  validity,  perceived 
similarity  was  Independently  rated  as  high  (mean  of  7.4  on  a  10  point  scale), 
and  contiguity  was  also  high.  The  similarity  ratings  for  the  alternative 
explanations  averaged  8.9  in  the  strong  vs.  1.0  in  the  weak  condition. 

Procedure  and  Design 

Subjects  were  given  booklets  containing  the  two  scenarios  and  they  were 
asked  to  rate  the  causal  strength  of  a  given  factor  on  the  same  100  point 
scale  used  in  the  prior  experiments.  Following  this,  subjects  were  given  an 
alternative  explanation  and  then  re-rated  the  causal  strength  of  the  original 
factor.  After  malting  these  two  ratings,  the  second  scenario  was  considered  in 
the  same  way.  Subjects  were  randomly  assigned  to  one  of  two  conditions:  half 
the  subjects  received  scenarios  paired  with  strong  alternatives,  and  the  other 
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half  received  scenarios  paired  with  weak  alternatives.  In  addition,  the  order 
of  scenario  presentation  was  randomized  across  subjects. 

The  design  of  the  experiment  followed  a  2  *  2  layout  with  one  between- 
subjects  factor  (strong  vs.  weak  alternatives)  and  one  wlthln-subjects  factor 
(two  scenarios).  The  dependent  variable  was  the  difference  between  the 
Initial  rating  and  the  re-assessment.  K  significant  main  effect  for  strong 
vs .  weak  alternatives  (in  the  direction  predicted  by  hypothesis  2b),  would 
mean  that  the  average  decrease  in  the  scenarios  with  strong  alternatives  Is 
greater  than  In  those  containing  weak  alternatives. 

Results 

The  results  of  the  analysls-of-varlance  are  shown  In  Table  8.  The  major 

Insert  Table  8  about  here 

finding  Is  that  the  main  effect  for  strong  vs.  weak  alternatives  Is  highly 
significant.  Indeed,  the  average  decrease  for  strong  alternatives  is  19.2 
(over  both  scenarios) ,  while  the  average  decrease  for  weak  alternatives  is 
only  1.8.  Thus,  hypothesis  2b  Is  strongly  supported  by  these  data. 

Discussion 

This  paper  has  emphasized  the  fundamental  role  of  causal  judgments  In 
diagnostic  Inference.  Furthermore,  we  have  demonstrated  three  crucial  aspects 
of  such  judgments:  (1)  that  causal  judgments  are  made  in  relation  to  a  causal 
background  or  field;  (2)  that  people  use  multiple,  probabilistic  cues  to 
causality  in  forming  their  judgments;  and  (3)  that  the  strength  and 
specificity  of  alternatives  affects  judged  causal  stength.  tfereover,  these 
Ideas  can  be  summarized  by  a  perceptual  analogy  in  which  figures  are  seen 
against  ground  (causal  candidates  are  dif ferences-ln-a-background) ,  good 
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figure*  arc  consistent  with  Gestalt  principles  (good  explanations  arise  from 
Internally  consistent  patterns  of  cues),  and,  good  figures  have  few 
alternatives  (as  do  good  explanations).  Moreover,  these  Ideas  are  embodied  In 
equation  (1),  which  Is  reproduced  here  for  convenience. 

N 

S  (Y,X  |b)  -  P[s(Y,X  |B)  -  l  s(Y,X.  | B) J  (j  *  k)  (1) 

J  J  |c— 1 

Our  empirical  results  and  their  Implications  are  now  discussed  in  light  of 
equation  (1). 

Causal  Background  (B) 

Causal  judgments  are  made  conditionally  on  some  background  and  causal 
candidates  are  seen  as  differences-in-that -background.  However,  the  issue 
arises  as  to  how  the  background  or  field  Is  defined  in  any  given  situation. 

One  Important  factor  concerns  the  expectations  a  person  brings  to  the  infer¬ 
ential  task.  These  expectations,  in  turn,  are  based  on  assumptions  that  one 
may  be  unaware  of.  Consider  our  experiment  concerning  the  breaking  of  a  watch 
face  glass  by  a  hammer.  When  subjects  were  presented  with  this  stimulus,  they 
assumed  a  normal  setting  In  which  the  breaking  of  the  glass  was  attributed  to 
the  force  of  a  hammer.  However,  when  the  context  was  made  explicit,  as  when 
watch  faces  are  tested  In  a  factory,  the  majority  of  subjects  Indicated  that 
the  cause  was  due  to  a  defect  In  the  glass.  Therefore,  in  the  former,  one 
expects  the.  force  of  a  hammer  to  be  sufficient  for  breaking  the  glass,  while 
In  the  latter,  one  doesn't  expect  owners  of  watch  factories  to  employ  testing 
procedures  that  are  financially  ruinous.  Thus,  defects  In  the  glass  are  seen 
as  being  of  greater  causal  relevance. 
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The  Idee  that  assumptions  and  expectations  can  affect  behavior  ia  well 
documented  in  the  psychological  literature.  Recall  ftruner  and  Postman's 
(1949)  classic  experiment  demonstrating  that  subjects  shown  playing  cards  with 
a  red  six  of  spades,  for  example,  reported  seeing  a  purple  six  of  hearts  or 
purple  six  of  spades,  tore  recently,  Loftus  (1979)  has  shown  that  eyewit¬ 
nesses  often  report  what  seem  to  be  compromises  between  what  they  actually  saw 
and  the  presupposition  underlying  the  question  they  are  asked.  Thus,  "leading 
questions"  seem  to  work  by  imposing  an  implicit  background  on  the  unsuspecting 
witness  such  that  the  strength  of  evidence  is  changed.  The  relevance  of  these 
findings  for  causal  judgments  is  direct;  the  causal  background  against  which 
judgments  are  made  is  crucial  for  assessing  causal  strength,  toreover,  the 
fact  that  shifts  in  background  car  and  do  occur,  coupled  with  lack  of  aware¬ 
ness  of  the  background,  suggest  that  disagreement  and  controversy  will  be 
endemic  in  discussions  of  causality.  Indeed,  the  search  for  an  agreed  on 
definition  of  cause,  based  solely  on  structural  aspects  of  inferential  tasks, 
would  seem  difficult  given  the  essential  role  of  context  in  defining  and 
assessing  causal  relations.  Therefore,  as  emphasized  in  equation  (1),  causal 
strength  is  a  relation  <k  .-wen  factors,  and  not  a  thing- in-it self. 

A  second  aspect  concerning  background  effects  Involves  the  narrowing  or 
broadening  of  the  background.  For  example,  consider  the  issue  of  reductionists 
in  causal  explanatlona ,  where  causes  at  a  molar  level  are  different  from  those 
at  a  molecular  level.  If  we  consider  the  background  B  as  analogous  to  the 
field  of  vision  under  a  microscope,  then  shifts  in  magnification  of  the  lens 
define  different  fields,  toreover,  since  causal  relevance  is  a  dlfference-ln- 
the-fleld,  it  is  obvious  that  a  cause  at  one  level  will  not  necessarily  be 
relevant  at  another.  This  microscope  analogy  makes  clear  that  the 
"appropriate"  level  of  magnification  depends  on  one's  purposes  and  the  extent 
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of  on® 'a  knowledge  of  Che  phenomenon  in  question.  Thus,  a  biochemist  may  see 
Che  causal  link  between  smoking  and  lung  cancer  aa  due  Co  chemical  effects  of 
Car,  nicotine,  and  the  like,  on  cell  structure,  while  an  Immunologist  might 
see  the  causal  link  as  due  to  the  suppression  of  the  Immune  system  in 
controlling  diseases  in  general.  However,  it  should  be  noted  that  the  level 
of  the  field  is  not  totally  arbitrary  in  everyday  Inferences.  Indeed,  there 
is  remarkable  consensus  among  Individuals  as  to  the  appropriate  level  of  the 
assumed  background.  On  the  other  hand,  where  large  discrepancies  exist  in 
knowledge  about  a  particular  topic,  as  in  comparing  experts  to  non-experts, 
such  consensus  is  often  lacking. 

Finally,  when  expectations  that  rest  on  an  assumed  background  are 
violated,  this  can  be  an  Important  cue  for  reorganizing  or  re-structuring 
one's  hypotheses.  For  example,  imagine  a  hit-and-run  accident  in  which  all  10 
witnesses  said  the  offending  car  was  going  73  miles  per  hour  at  the  moment  of 
impact.  Since  we  expect  much  greater  variability  in  such  estimates,  as  well 
as  round  numbers,  this  surprising  unanimity  might  cue  one  to  ask  whether  the 
witnesses  had  colluded  in  their  responses.  Similarly,  in  our  earlier  paper 
(Einhorn  &  Hogarth,  Note  1),  we  noted  that  the  structure  of  outcomes  can 
suggest  new  hypotheses  such  that  the  diagnosis  contradicts  the  surface  meaning 
of  the  evidence.  Thus,  scientific  data  that  are  too  perfect  can  suggest  fraud 
(see,  for  example,  Kamln,  1974,  on  Burt's  twin  data;  Bishop,  et  al.,  1975,  on 
Mradel's  pea  experiments) ,  evidence  in  a  trial  that  is  too  consistent  and 
obvious  can  suggest  the  defendant  was  "framed,'*  and  one  can  "protesteth  too 
much"  in  a  variety  of  circumstances.  Such  examples  illustrate  that  violations 
of  expectations  can  trigger  a  re-structuring  of  alternatives.  Of  course, 
specifying  the  conditions  that  lead  to  re-structuring  as  opposed  to  other 
responses  remains  an  important,  and  unanswered  question. 
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Gross  Strength  and  the  Cues  to  Causality 

The  results  of  experiments  1  and  2  show  that  judgments  of  causal  strength 
are  affected  by  the  causal  cues  of  perceived  similarity  and  validity,  as  well 
as  by  contiguity.  While  the  latter  was  not  significant  in  the  first  experi¬ 
ment,  increasing  its  salience  in  the  second  experiment  did  lead  to  its  tu.e. 

Of  course,  we  did  not  test  all  of  the  cues  previously  discussed  since  this 
would  require  many  studies.  However,  in  reviewing  the  psychological 
literature  in  various  sub-fields,  we  pointed  to  several  studies  that  have 
varied  particular  cues,  with  results  that  are  consistent  with  our  framework, 
moreover,  whereas  we  made  no  claim  that  the  particular  cues  that  we-  considered 
are  new,  we  do  claim  that  the  conception  of  these  factors  as  probabilistic 
cues  (in  the  Brunwlklan  sense),  and  their  categorization  as  "content**  or 
"data"  based  variables,  is  new.  Furthermore,  by  considering  causal  judgments 
as  resulting  from  the  weighting  and  combining  of  cues,  "causality"  can  be 
discussed  as  a  psychological  topic  within  the  well-defined  and  researched  area 
of  behavioral  decision  theory,  thereby  directing  attention  to  Issues  that 
might  otherwise  be  ignored.  We  now  consider  some  of  these:  (a)  What  are  the 
ecological  validities  of  the  cues?;  (b)  What  role  does  cue  redundancy  (inter- 
correlation)  play  in  causal  judgments?;  (c)  What  are  the  function  forms  that 
relate  lower  level  to  higher  level  cues? 

(a)  It  has  been  assumed  throughout  this  paper  that  the  cues  to  causality 
have  Imperfect  but  non-zero  ecological  validities;  l.e.,  each  cue  is  predic¬ 
tive  of  a  true  causal  relation.  How  do  we  know  this?  Simply  put,  we  don't. 
The  reason  is  that  without  some  measure  of  "true"  causality,  no  determination 
of  accurate  causal  knowledge  is  strictly  possible.  However,  the  fact  that  the 
cues  we  have  considered  ere  implicated  in  a  wide  variety  of  studies  with  both 


human  and  animal  subjects,  leads  us  to  believe  that  they  would  not  continue  to 
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be  used  If  they  were  useless.  Therefore,  our  argument  Is  a  functional  and 

practical  one;  viz.,  given  the  Importance  of  learning  and  Inferring  causal 

relations  for  survival,  we  would  find  It  surprising  that  the  cues  on  which 

this  depends  are  totally  worthless.  On  the  other  hand,  we  do  not  advocate  the 

position  that  If  something  Is  used,  It  must  be  beneficial  for  the  organism. 

Such  a  view  Is  untenable  for  many  reasons  (see  Elnhorn  &  Hogarth,  1981). 

Thus,  while  we  have  assumed  that  the  cues  to  causality  have  ecological 

validity,  their  Imperfect  validity  can  also  lead  to  superstitions,  myth, 

magic,  and  the  like.  Indeed,  the  relation  between  causal  knowledge  and 

superstition  Is  discussed  in  the  next  section. 

(b)  While  we  have  treated  the  cues  to  causality  as  conceptually 

distinct,  it  seems  likely  that  they  are  correlated  in  the  environment. 

However,  the  determination  of  these  correlations  would  require  an  elaborate 

(and  problematic)  ecological  analysis  that  Is  beyond  the  scope  of  this 

paper.  Nevertheless,  the  assumption  of  correlated  cues  seems  warranted  since 

people  have  strong  expectations  concerning  what  cues  go  together.  Indeed, 

just  as  In  the  perception  of  Incomplete  figures  (where  one  fills  in  the 

missing  parts),  scenarios  are  filled  in  by  assuming  that  cues  not  explicitly 

mentioned  are  In  fact  present.  Thus,  the  fact  that  one  generally  perceives 

the  world  as  coherent,  suggests  that  the  cues  to  causality  are  redundant  to 

some  degree.  Given  this,  an  important  implication  of  redundancy  concerns  its 

beneficial  effects  with  regard  to  simplifying  Information  search  and  the 

combining  of  cues.  For  example,  when  cues  are  redundant, 

(a)  Information  search  Is  limited  without  large  losses  In  predictive 
accuracy;  (b)  attention  is  highly  selective;  (c)  dimensionality  of  the 
Information  space  Is  reduced,  thereby  preventing  Information  overload; 

(d)  intersubstitutability  of  cues  is  facilitated  (Hammond,  1972);  and 

(e)  unreliability  of  cues  Is  alleviated  by  having  multiple  measures  of 
the  same  cue  variable  (Elnhorn,  Kleinmuntz,  &  Kleinmuntz,  1979,  p.  466). 
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Therefore,  although  we  have  posited  several  causal  cues.  It  is  unlikely  that 
all  Mill  be  used  or  attended  to  In  every  situation.  Indeed,  recall  that  the 
contiguity  cue  was  not  used  In  the  first  experiment,  but  was  used  when 
attention  was  directed  to  it  in  experiment  2.  hbreover,  while  correlated  cues 
allow  one  to  fill  in  missing  information  in  making  inferences,  their  less  than 
perfect  redundancy  implies  that  in  "going  beyond  the  Information  given,"  some 
error  is  inevitable.  However,  the  fact  that  inferences  are  often  accurate 
attests  to  the  prevalence  and  functional  value  of  redundant  cues,  (c)  In 
considering  how  cues  are  combined  at  various  levels,  recall  Table  4,  which 
provides  a  schematic  representation  of  our  theoretical  model.  A  question  that 
is  common  to  all  levels  is  the  exact  nature  of  the  functions  that  relate  com¬ 
ponents  on  the  right-hand  side  of  the  equations  to  those  on  the  left.  In 
fact,  this  general  question  is  the  focus  of  much  attention  in  the  judgment  and 
choice  literature  (e.g.,  Anderson,  1979;  Edwards,  1977;  Hammond,  tfcQelland  & 
Mmpower,  1980).  However,  rather  than  considering  each  function  in  Table  4, 
we  consider  the  larger  issue  as  to  whether  the  rules  for  combining  causal  cues 
are  compensatory,  noncompensatory,  or  some  combination  of  both.  That  is,  do 
causal  cues  trade-off  or  not,  and  under  what  conditions? 

In  our  experimental  work,  and  in  all  of  the  research  we  have  quoted,  a 
compensatory  rule  (usually  additive)  was  assumed  for  combining  cues  into 
overall  judgments  of  causal  strength.  As  in  much  other  judgment  research, 
such  rules  have  been  found  to  work  remarkably  well  (Einhorn,  et  al.,  1979). 
Nevertheless,  there  are  reasons  to  suspect  that  certain  causal  cues  may  not 
trade-off  at  all  (e.g.,  temporal  order)  or,  trade-offs  will  only  occur  beyond 
some  threshold  or  minimal  level.  In  particular,  the  cue  of  perceived 
similarity  is  likely  to  be  one  in  which  some  amount  must  be  present  before  any 
causal  relation  is  seen.  However,  given  that  this  level  is  maintained,  trade- 
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off*  of  perceived  similarity  with  other  cues  can  occur  (as  demonstrated  in 
experiments  1  and  2) .  Thus ,  some  combination  of  noncompensatory  and  com¬ 
pensatory  rules  Is  likely  to  be  used  In  conjunction  with  this  cue  (cf.  Luce, 
1978;  Payne,  1976).  Indeed,  we  expect  that  this  pattern  Is  generally  true  of 
the  content-based  cues. 

The  Bole  of  Specific  Alternatives 

Following  the  prediction  implicit  in  equation  (1),  the  provision  of 
explicit  alternatives  with  high  gross  strength  significantly  reduced  judged 
causal  strength  in  experiments  1  and  2.  tbreover,  the  differential  decrease 
for  strong  vs.  weak  alternatives  was  unequivocally  demonstrated  In 
experiment  3.  These  results  are  consistent  with  findings  from  other 
studies.  For  example,  in  the  previously  mentioned  study  by  Schustack  and 
Sternberg  (1981),  subjects  considered  data  relating  to  alternative 
hypotheses.  A  regression  analysis  of  subjects'  causal  judgments  showed  a 
significant  negative  weight  for  the  strength  of  the  alternatives,  thus 
paralleling  equation  (1).  In  a  related  vein,  Koriat,  Lichtenstein,  and 
Flschhoff  (1980)  found  that  when  subjects  generated  specific  arguments  against 
a  position  they  adopted,  their  confidence  in  the  original  position  was 
reduced . 

It  was  stated  earlier  (p.8),  that  we  viewed  equation  (1)  as  embodying  an 
anchoring  and  adjustment  strategy.  Furthermore,  we  stipulated  that  when  gross 
strength  (the  anchor)  is  low,  there  would  be  no  need  to  adjust  for  alternative 
explanations.  After  all,  why  waste  effort  on  a  worthless  explanation?  Data 
from  experiment  1  speak  to  this  point.  Subjects  in  the  high  perceived 
similarity-  high  perceived  validity  condition  (over  all  eight  scenarios)  gave 
an  initial  mean  rating  of  57  for  causal  strength,  as  opposed  to  18  for  the 


versions  containing  low  levels  of  these  cues.  However,  after  seeing  the  sane 
strong  alternative,  the  former  judgments  decreased  to  37  while  the  latter  only 
decreased  to  15  (l.e.,  a  decrease  of  20  vs.  3).  Thus,  strong  alternatives 
"hurt**  strong  explanations  more  that  they  hurt  weak  ones.  While  this  is  not 
surprising  on  an  absolute  basis,  note  that  the  proportional  drop  is  much 
greater  for  the  good  as  opposed  to  the  poor  original  explanations  (35Z  vs. 
18%).  This  result  is  suggestive  that  our  hypothesis  given  above  is  correct. 

It  is  not,  however,  definitive  since  we  forced  people  to  consider  a  specific 
alternative.  In  a  natural  setting,  we  would  expect  that  the  gross  strength  of 
a  causal  candidate  has  to  reach  some  threshold  value  before  people  engage  in 
counter factual  reasoning. 

The  notion  of  a  gross  strength  threshold  emphasizes  the  need  to  specify 
the  form  of  equation  (1)  more  precisely,  as  well  as  the  scale  properties  of 
the  gross  strength  variable.  For  example,  we  have  assumed  that  the  gross 
strengths  of  alternatives  in  our  experiments  were  non-negative  and  indeed,  the 
introduction  of  specific  alternatives  generally  led  to  a  decrease  in  judged 
causal  strength.  However,  explicit  alternatives  could  have  negative  gross 
strength  (by  considering  them  in  relation  to  the  background,  B) ,  which  would 
lead  to  an  Increase  in  the  net  strength  of  a  causal  candidate.  For  example, 
imagine  that  you  believe  strongly  in  a  scientific  theory  for  which  there  are 
few  competitors.  You  are  then  presented  with  an  absurd  alternative 
explanation  which  leads  to  the  following  inference:  if  this  is  the  best 
alternative  that  people  can  generate,  your  belief  in  the  original  theory 
should  be  Increased.  While  the  frequency  of  such  inferences  is  not  known, 
equation  (1)  can  handle  these  reversals  of  net  strength  at  a  conceptual 
level.  However,  it  can  do  no  more  that  this  without  a  better  measure  of  gross 
strength. 


Whatever  the  merits  of  considering  a  gross  strength  threshold,  the 

purpose  of  diagnostic  inference  is  to  generate  some  causal  explanation  for 

observed  effects.  Thus,  while  a  particular  explanation  may  be  poor  and  not 

require  specific  alternatives  for  contrast,  this  does  not  mean  that  the 

process  terminates  at  this  point.  Indeed,  one  is  still  left  with  the 

question,  “If  it  wasn't  X,  what  did  cause  Y?"  Therefore,  while  the  testing  of 

hypotheses  via  comparison  with  alternatives  is  part  of  diagnostic  inference, 

it  also  involves  a  continuing  search  for  better  explanations.  The  distinction 

between  testing  hypotheses  and  searching  for  better  ones  can  be  likened  to  a 

“disconfirmation"  vs.  “replacement'*  model  of  inference.  As  we  pointed  out 

before  (Elnhorn  &  Hogarth,  Note  1),  the  replacement  model, 

....is  consistent  with  the  Kihnlan  notion  that  theories  in  science 
are  not  discarded,  despite  evidence  to  the  contrary,  if  they  are  not 
replaced  by  better  alternatives  (Kuhn,  1962)... We  believe  that  the 
replacement  view  is  equally  strong  in  everyday  Inference. . .Unless 
disconfirmation  leads  to  replacement,  it  can  only  Increase 
uncertainty  and  anxiety.  A  useful  analogy  might  be  the  following: 
how  many  people  would  read  detective  stories  if  the  author  only 
revealed  who  didn't  do  it?  (p.  46) 

Furthermore,  note  that  replacement  is  more  powerful  than  disconfirmation  since 
the  former  can  subsume  the  latter.  For  example,  compare  most  real  trials, 
when  the  accused  is  found  Innocent  because  of  reasonable  doubt  vs.  trials 
depicted  in  movies  (e.g.  Perry  feson),  when  the  accused  is  found  Innocent  by 
unmasking  the  real  culprit.  Such  demonstrations  simultaneously  dlsconfirm  the 
hypothesis  of  guilt  for  the  accused,  while  answering  the  question:  if  not  X, 
then  who? 

Normative  Issues 

Our  theoretical  and  experimental  findings  raise  the  following  question: 
what  should  be  the  role  of  content  and  imagination  in  diagnostic  inference? 

In  particular,  since  there  is  no  agreed-on  theory  of  causality,  is  it  possible 


Co  My  anything  abouC  the  quality  of  diagnostic  inferences  and  the  causal 
Judgments  on  which  they  are  based?  While  we  have  no  definitive  answers,  we 
discuss  three  trade-offs  that  are  germane  to  this  question:  (1)  acquisition  of 
causal  knowledge  vs.  superstition;  (2)  achieving  "order-out-of-  chaos”  vs. 
limiting  creativity;  (3)  using  imagination  vs.  Increasing  uncertainty. 


Causal  knowledge  vs.  superstition 

We  have  asserted  that  the  cues  to  causality  have  ecological  validity  and 
that  accurate  causal  knowledge  depends  partly  on  their  use.  However,  since 
the  cues  are  imperfectly  valid,  the  discovery  of  causal  relations  can  be 
likened  to  a  complex,  multivariate  signal  detection  task  where  the  presence  of 
cause  Is  sought  against  a  background  of  randomness  or  noise  (cf.  Lopes,  in 
press).  There  are  several  implications  of  this  signal  detection  analogy. 
First,  people  must  set  a  cut-off  point  to  decide  whether  or  not  some  factor  is 
to  be  considered  a  cause.  Second,  the  position  of  this  cut-off  will  reflect 
two  types  of  errors  and  their  associated  costs.  That  is,  on  the  one  hand, 
people  can  infer  causes  when  they  do  not  exist;  on  the  other  hand,  they  can 
make  the  error  of  falling  to  infer  true  causal  relations.  M>reover,  whereas 
several  studies  have  addressed  the  former  and  discussed  human  susceptibility 
tc<  "illusions  of  control”  (e.g.,  Langer ,  1973),  there  has  been  less  awareness 
of  illusions  of  lack  of  control  (however,  see  Sellgman,  1975;  Alloy  & 

Abramson,  1979).  Nonetheless,  given  the  importance  of  inferring  causal 
relations  for  survival,  one  could  argue  that  the  former  Illusion  is  less 
costly  than  the  latter.  Indeed,  one  can  consider  superstition  as  the  price 
that  one  pays  for  causal  knowledge  (cf.  Skinner,  1966),  although  it  is  an  open 
question  as  to  whether  the  price  is  worth  the  benefits  in  any  particular 
situation.  Finally,  a  third  implication  of  the  signal  detection  analogy  is 
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Chat  people  nay  exhibit  dll'  i entlal  sensitivity  to  seeing  causal  relations 
through  either  training  (e .g . ,  developing  expertise),  or  ability. 

In  addition  to  using  the  cues  to  causality,  our  position  also  laplles 
that  accurate  causal  judgment  Involves  the  elimination  of  alternative 
explanations.  However,  as  vividly  demonstrated  by  the  concept  of  spurious 
correlation  several  variables  may  be  highly  correlated  in  the  natural  ecology 
so  that  the  determination  of  causal  relations  is  problematic.  Thus,  from  this 
viewpoint  one  can  sympathize  with  the  Ingenuous  teenager  who  asked  Dear  Abby: 
would  she  get  pregnant  from  holding  hands  with  her  boy-friend?  Given  that 
this  causal  candidate  and  the  true  cause  are  both  correlated  and  share  many  of 
the  same  cues  to  causality,  only  a  true  experiment  could  resolve  the  issue. 
Indeed,  the  Importance  of  experiments  for  disentangling  correlated  factors  has 
been  stressed  by  Hammond  (1978).  He  points  out  that  much  learning  through 
experience  often  rests  on  the  weakest  mode  of  inference— unaided  judgment 
based  on  passive  observation.  From  a  normative  viewpoint,  the  prevalence  of 
correlated  alternatives  reinforces  the  need  for  experimentation  In  making 
valid  causal  Inferences  (cf.  Einhom  &  Hogarth,  1978). 

Order  out-of-chaos  vs.  creative  thought 

The  causal  field  and  the  cues  to  causality  both  play  an  Important  role  In 
limiting  the  number  of  interpretations  people  make  In  inferential  tasks,  and 
thus  in  creating  "order-out-of-chaos" .  Furthermore,  the  adoption  of  a 
particular  background  and  the  use  of  the  cues  proceed  quickly  and  are  often 
marked  by  a  lack  of  awareness  that  a  delimiting  process  has  taken  place.  The 
benefits  to  be  gained  from  such  automatized  processes  are  large.  However, 
they  come  at  the  cost  of  reducing  the  probability  that  people  can  achieve  more 
creative  representations  of  inferential  tasks.  Indeed,  Campbell  (1960)  has 
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stressed  the  Importance  of  deliberately  introducing  random  variation  to 
stimulate  creative  efforts,  especially  in  science.  Without  such  random 
perturbations,  he  argues  that  the  forces  that  maintain  a  person's  particular 
conception  of  a  problem  are  too  strong.  Moreover,  the  literature  on 
creativity  has  many  examples  of  techniques  that  are  aimed  precisely  at  making 
people  aware  of  the  delimiting  assumptions  they  bring  to  tasks  (see,  e.g., 
Adams,  1976).  In  addition,  when  using  such  techniques,  people  are  often 
requested  to  refrain  from  counterfactual  reasoning  and  to  make  specific  use  of 
analogies  and  paradox  to  enjoin  previously  disconnected  ideas.  In  short,  to 
restructure  problems  in  creative  ways  frequently  requires  attempts  to  counter 
the  habitual  forces  of  causal  reasoning. 

Doing  imagination  vs.  increasing  uncertainty 

Our  position  is  that  imagination  should  play  an  important  role  in 
diagnostic  inference.  As  noted  above,  imagination  is  clearly  implicated  in 
creative  thought.  M>reover,  tie  find  much  merit  in  using  imagination  via  an 
anchoring  and  adjustment  process  in  the  testing  of  hypotheses.  That  is,  we 
believe  that  comparisons  of  "what  is”  with  "what  might  have  been"  have  many 
benefits.  First,  whereas  the  generation  of  specific  alternatives  to  a  given 
hypothesis  increases  uncertainty  in  that  hypothesis,  such  uncertainty  can  be 
beneficial  for  counteracting  overconfidence  in  judgment.  Indeed,  it  has  been 
argued  that  one  of  the  most  prevalent  cognitive  biases  is  "cognitive  myopia," 
which  results  precisely  from  the  failure  to  imagine  alternative  futures 
(Hogarth,  1981).  Second,  whereas  one  can  disconfirm  a  hypothesis  by  observing 
data,  imagination  is  necessary  for  the  more  important  task  of  replacement  (see 
above) . 


The  use  of  imagination  requires  considerable  mental  effort  and  is  not 
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therefore,  costless.  Moreover,  by  imagining  the  world  ss  it  night  have  been, 
increased  uncertainty  may  lead  to  feelings  of  anxiety,  an  observation  which 
leads  us  to  speculate  that  imagination  may  provide  a  crucial  link  between 
cognition  and  affect  (cf.  Zajonc,  1980).  That  is,  whereas  we  have  limited 
ourselves  to  cognitive  comparisons  between  "what  is"  and  "what  might  have 
been",  such  a  process  may  also  underlie  various  emotional  reactions.  For 
example,  one  might  consider  regret  as  the  difference  between  what  I  did  and 
what  I  might  have  done  (Kahneman  &  Tver sky,  1982);  guilt  as  the  difference 
between  what  I  did  and  what  I  should  have  done;  shame  as  what  I  did  vs.  what 
others  expected  me  to  have  done;  equity  as  what  1  have  vs.  what  others  have; 
and  so  on.  Clearly,  much  work  needs  to  be  done  before  this  suggestion  can  be 
taken  seriously.  However,  we  have  at  least  provided  an  anchor  for  further 
research  on  this  issue. 


Cbadumloa 

We  began  this  paper  by  emphasizing  that  diagnostic  Inference  Involves 
judgments  of  causality  made  under  conditions  of  uncertainty.  Moreover,  such 
Judgments  were  formalized  in  a  model  consisting  of  three  main  components: 
causal  background,  cues  to  causality,  and  specific  alternatives.  Whereas  our 
model  accounts  for  many  findings  in  the  literature  as  well  as  our  own 
experimental  results,  it  by  no  means  explicates  all  aspects  of  causal 
reasoning.  In  particular,  judgments  of  causation  in  the  social  realm  (e.g., 
attribution  theory)  were  not  explored  although  many  of  our  ideas  may  prove 
germane  to  this  topic.  However,  given  the  complexity  of  causal  judgment,  it 
seems  appropriate  to  have  started  with  a  simple  model  based  on  alternatives, 
background,  and  _cues;  i.e.,  the  ABC  of  causality. 


Footnotes 
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TABLE  1 

Effects  of  Shifts  In  the  Causal  Background 


Background  (B) 


Force  of 

hanaer 


Background  (C) 

Defect  In 
glass 

Other 

explanations 


Force  of 
hammer 

Defect  in 
glass 

Other 

explanations 

23 

0 

0 

32 

2 

2 

5 

0 

2 

60 

2 

4 

23 

36 

7 

66 


Note;  One  subject  responded  to  only  one  version  of  the  stimulus  and  is  therefore  excluded 
from  the  analysis. 
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TABLE  2 

Empirical  Evidence  Relating  to  Conflict  between 
Contiguity  and  Perceived  Similarity 


Contiguity 


High 


Low 


Claa8lcal  conditioning 

Garcia  et  al  (1968;  1972) 

High 

MLchotte  (1946) 

Shultz  &  Ravlnsky  (1977) 

♦(Similarity) 

Garcia  et  al  (1968;  1972) 
Hichotte  (1946) 

Low 

Nlabett  &  Rosa  (1980) 

Shultz  &  Ravlnsky  (1977) 
Sellgman  (1970) 

No  relation 
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TABLE  3 

Data  tfatrix  for  Hypothetical 
Intercourse-Pregnancy  Experiment 


25 


5 


2i 


TABLE  5 

Analysis  of  Variance  for  Experiment  1 


Source  of  variation 

df 

16 

F 

P  < 

Between  subjects 

31 

Groups 

7 

820.78 

<  1 

n.s. 

Subjects  within  groups 

24 

1,687.50 

Within  subjects 

224 

V( similarity)  (A) 

1 

12,762.53 

29.45 

.01 

Contiguity  ( B) 

1 

1,410.94 

3.26 

n.s. 

V< validity)  (C) 

1 

41,692.53 

96.21 

.01 

A  x  C 

1 

2,697.50 

6.22 

.05 

B  x  C 

1561.09 

1.30 

n.s. 

Scenarios  (D) 

7 

2,145.56 

4.95 

.05 

D  x  A 

7 

1,397.28 

3.22 

.01 

D  x  C 

7 

1,040.65 

2.40 

.05 

D  x  A  x  C 

7 

620.10 

1.43 

11  efl  • 

Error  (within) 

191 

433.37 

i 

L 


T 


^(similarity) 
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TABLE  6 

Ms  an  Ratings  of  Causal  Strength  by  Perceived 
Validity  and  Similarity: 

Experiment  1 


(validity) 


High 

Low 

High 

58 

26 

42 

Low 

37 

19 

27 

47 


22 


34 


TABLE  7 

Analysis  o£  Variance  for  Experiment  2 


Sources  of  variation 

df 

IS 

F 

P  < 

Between  subjects 

31 

Groups 

7 

1059.98 

1.25 

n.s 

Subjects  within 

groups 

24 

848.84 

Within  subjects 

96 

V (similarity) 

(A) 

1 

5859.03 

11.91 

.01 

Contiguity 

(B) 

1 

2538.28 

5.16 

.03 

^(validity) 

(C) 

1 

9214.03 

18.97 

.01 

A  x  C 

1 

39.06 

<  1 

n.s 

B  x  C 

1 

342.25 

<  1 

n.s 

3  2072.73 


f 


Scenarios  (D) 

Error  (within) 


88 


492.15 


4.21 


01 
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TABLE  8 


Analysis  of 

Variance 

for  Experiment 

3 

Source  of  Variation 

df 

MS 

F 

P  < 

Between  subjects 

75 

Strength  of  alternative  (A) 

1 

11,515.32 

46.11 

.001 

Subjects  with  groups 

74 

249.72 

Within  subjects 

76 

Scenarios  (B) 

1 

819.80 

3.40 

n.s 

A  x  B 

1 

394.90 

1.64 

n.s. 

B  x  subjects  within  groups 

74 

241.01 
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